Ars Technica→ original

Wikipedia opens content to AI companies through licensing agreements

Wikimedia Enterprise заключила лицензионные соглашения с крупными ИИ-компаниями, включая Microsoft, Meta и Amazon. Теперь они смогут использовать контент Википе

AI-processed from Ars Technica; edited by Hamidun News
Wikipedia opens content to AI companies through licensing agreements
Source: Ars Technica. Collage: Hamidun News.
◐ Listen to article

Wikipedia, one of the world's largest sources of knowledge, is opening its doors to artificial intelligence. Wikimedia Enterprise, the commercial division of the Wikimedia Foundation, has concluded a series of licensing agreements with technology giants such as Microsoft, Meta, Amazon, as well as promising startups Perplexity AI and Mistral AI. These agreements provide AI companies with access to Wikipedia's vast database for training and improving their models.

For a long time, Wikipedia remained a free and open resource available to everyone. However, with the growing popularity of large language models (LLMs), demand for high-quality data to train them has increased sharply. Wikimedia Enterprise saw an opportunity to monetize its content while maintaining the principles of openness and accessibility that underpin Wikipedia. Paid licenses provide companies with structured and optimized access to data, as well as technical support, which significantly simplifies the process of training AI models.

What does this mean for the AI industry? First, it provides access to a massive volume of verified and structured information, which is critical for training quality and reliable LLMs. Wikipedia contains millions of articles in various languages, covering a wide range of topics – from history and science to culture and technology. Using this data will help AI models better understand the world and generate more relevant and accurate answers. Second, licensing agreements provide a sustainable source of funding for the Wikimedia Foundation, allowing the organization to continue maintaining and developing Wikipedia as a global knowledge resource.

However, this step carries potential risks. It is important that the use of Wikipedia content complies with the principles of neutrality and objectivity underlying the encyclopedia. We must prevent situations where AI models trained on Wikipedia data spread misinformation or biased opinions. The Wikimedia Foundation must carefully monitor the use of its content and respond quickly to any violations. Additionally, it is important to ensure transparency regarding which Wikipedia data is used to train various AI models.

The conclusion of licensing agreements with AI companies is an important step for Wikipedia and the entire artificial intelligence industry. It opens new opportunities for AI development, but requires a responsible approach and adherence to the principles of openness and neutrality. In the future, we will likely see other major data sources follow Wikipedia's example and begin monetizing their content for AI training. This could lead to the formation of a new market for AI data, which in turn would have a significant impact on the development of artificial intelligence technologies.

ZK
Hamidun News
AI news without noise. Daily editorial selection from 400+ sources. A product by Zhemal Khamidun, Head of AI at Alpina Digital.

Want to stop reading about AI and start using it?

AI News is a curated feed of AI/tech news. Hamidun Academy teaches you to use AI systematically in your work.

What do you think?
Loading comments…