🚀💳 Trustee Plus - більше ніж криптогаманець з європейською платіжною карткою. Спробуй 👉

Court documents reveal Meta's secret experiments in AI training

The high-profile lawsuit against Meta has revealed a wealth of internal company documents. One document in particular has caught the attention of some AI researchers: it details a method for improving Llama’s AI models.

Leave a comment
Court documents reveal Meta's secret experiments in AI training

The high-profile lawsuit against Meta has revealed a wealth of internal company documents. One document in particular has caught the attention of some AI researchers: it details a method for improving Llama’s AI models.

These court documents describe how Meta researchers used a process called ablation to determine what data helped improve Llama's AI models, Business Insider writes .

Ablation is a medical technique that purposefully destroys tissue to improve brain function. In AI, it involves removing parts of a system to study how those components affect performance.

In Meta’s ablation experiments, the company replaced some of the AI ​​training data with pirated books from the giant LibGen database. The company then retrained its Llama model to see how that affected the results.

We wrote about the Russian pirate library LibGen earlier. Meta tried to prevent publicity by filing a lawsuit about the fact that the company used LibGen to train AI. The case concerned copyright infringement, the so-called “Kadrey v. Meta”. In addition, it later turned out that Meta may have trained its AI models on unpublished books .

In one experiment, Meta added science and technology books as well as fiction books to the training data. In a second experiment, it added only fiction books.

In both experiments, Llama's performance improved noticeably in industry benchmarks, according to an internal Meta document (p. 18-19).

This suggests that Meta has the ability to assign meaning to specific learning data,” says Nick Vincent, an associate professor in the School of Computer Science at Simon Fraser University.

For example, one Meta engineer on LinkedIn mentions performing over 100 ablations during the development of Llama 4 and previous versions of the company’s large AI models.

Meta does not publish the results of these experiments, and other AI companies also keep this information secret, Vincent said.

One possible reason: If tech giants tell the world exactly what training data helped their AI models, the creators of that information will want to get paid—and they can calculate how much money they're owed.

The release of the results of ablation experiments could also affect the serious copyright lawsuits raging in the tech industry — a good example is this case, "Kadrey v. Meta."

In such cases, tech giants and AI startups argue that “training” machines based on material published online is not copyright infringement. And such internal documents that determine the value of certain content can work against them.

Secret Meta Ablation Results

Meta’s ablation experiments focus on this first stage of learning, which uses mountains of data to help models understand the world. For example, to teach a machine to recognize a llama, you need to show it as many pictures of llamas and alpacas as possible so that it can tell the difference between the two animals.

Meta’s first ablation experiment showed that adding science, technology, and fiction books to the training data improved Llama’s performance by 4.5% on the industry benchmark BooIQ. Adding fiction books alone resulted in a 6% improvement.

BoolQ is a test of 15,942 yes/no questions that AI models must answer. The more questions they answer, the better their performance. A 5% improvement is equivalent to answering almost 800 additional questions correctly.

According to an internal Meta document, the performance gain from these ablation experiments was 5.5% on another test known as SIQA.

Peter Henderson, an associate professor of computer science at Princeton, posted several diagrams from the court document on Twitter that demonstrate these achievements.

While a performance gain of around 5% seems small, in the AI ​​race, any advantage is important.

“It’s actually a lot, because it’s very difficult to get every extra point on AI tests,” said Bill Gross, CEO of ProRata, a startup that tries to compensate creators for their contributions to AI.

Meta introduces a new generation of open AI models: Llama 4. Context window for 10 million tokens. Comparison with competitors
Meta introduces a new generation of open AI models: Llama 4. Context window for 10 million tokens. Comparison with competitors
On the topic
Meta introduces a new generation of open AI models: Llama 4. Context window for 10 million tokens. Comparison with competitors
Meta provided the Ministry of Digital Economy with consultants and developers to create a Ukrainian “national language model” based on Llama
Meta provided the Ministry of Digital Economy with consultants and developers to create a Ukrainian “national language model” based on Llama
On the topic
Meta provided the Ministry of Digital Economy with consultants and developers to create a Ukrainian “national language model” based on Llama
A vulnerability has been discovered in Meta's Llama framework that exposes AI systems to risks of remote code execution
A vulnerability has been discovered in Meta's Llama framework that exposes AI systems to risks of remote code execution
On the topic
A vulnerability has been discovered in Meta's Llama framework that exposes AI systems to risks of remote code execution
Read the country's main IT news in our Telegram
Read the country's main IT news in our Telegram
On the topic
Read the country's main IT news in our Telegram
Also Read
Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть
Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть
Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть
Жодних ігор у метавсесвіті: Facebook припинить підтримку свого сервісу для геймерів
Жодних ігор у метавсесвіті: Facebook припинить підтримку свого сервісу для геймерів
Жодних ігор у метавсесвіті: Facebook припинить підтримку свого сервісу для геймерів
«В жовтні випускаємо VR-шолом для аватарів, в «чіпування» Neuralink Маска вірю мало». Про що глава Meta Цукерберг 3 години говорив в подкасті Джо Рогана
«В жовтні випускаємо VR-шолом для аватарів, в «чіпування» Neuralink Маска вірю мало». Про що глава Meta Цукерберг 3 години говорив в подкасті Джо Рогана
«В жовтні випускаємо VR-шолом для аватарів, в «чіпування» Neuralink Маска вірю мало». Про що глава Meta Цукерберг 3 години говорив в подкасті Джо Рогана
25 серпня вийшла чергова серія популярного подкасту The Joe Rogan Experience, гостем якого став глава компанії Meta Марк Цукерберг. Розповідаємо про головне з майже 3-годинного інтерв’ю.
Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua
Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua
Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua
Нейронні мережі для генерації зображень бачать світ по-своєму, їхню логіку зрозуміти часом зовсім неможливо. Але таки хочеться. На честь Дня Незалежності України редакція dev.ua вирішила провести невеликий експеримент. Ми задали чотирьом різним нейронним мережам п’ять однакових запитів: «прапор України», «День Незалежності України», «український Крим», «перемога України» та «українці». Отриманими результатами ми ділимося з вами нижче.

Have important news to share? Message our Telegram bot

Key events and useful links in our Telegram channel

Discussion
No comments yet.