/* ---- Google Analytics Code Below */

Saturday, April 03, 2021

Building Multilingual, Multipurpose, Multi Context Wikipedias

A long time user, and supporter of Wikipedia.  Have run into the problem that this describes,   Posts in different languages in the WP vary in their knowledge content.  In fact in too many examples,  there may be useful, detailed posts in one language, but the same topic is uncovered in another language. Within 50 million articles in 300 languages.   We ran into some related problems when we scoped out a wikipedia for a large corporation.  So I expanded the title of this piece, to include other aspects. But the article referenced below uses a 'language' knowledge problem example. Sharing knowledge.   Tough problem, let me know if you solve it effectively.

Building a Multilingual Wikipedia,   By Denny Vrandečić  in CACM

Communications of the ACM, April 2021, Vol. 64 No. 4, Pages 38-41  10.1145/3425778

Wikipedia has more than 50 million articles in approximately 300 languages. The content in these languages is independently created and maintained. The knowledge in Wikipedia is very unevenly distributed over the languages: some languages have more than a million articles, but more than 50 languages have only a few hundred articles or less. More importantly, also the number of contributors is very unevenly distributed: English Wikipedia has more than 418,000 contributors, the second-most active one, Spanish, drops down to 90,000. More than half of language editions have fewer than 10 contributors doing more than four edits per month. To assume that fewer than 10 active contributors can write and maintain a comprehensive encyclopedia in their spare time is optimistic at best.

In order to close these knowledge gaps we are building a multilingual Wikipedia where content is created only once but made available in all languages. The multilingual Wikipedia has two main components: Abstract Wikipedia where the content is created and maintained in a language-independent notation, and Wikifunctions, a project to create, catalog, and maintain functions. For the multilingual Wikipedia, the most important function is one that takes content from Abstract Wikipedia and renders it in natural language, which in turn gets integrated into Wikipedia proper.

This will considerably reduce the effort required to create a comprehensive and maintain a current encyclopedia in many languages. It will allow more people to share more knowledge in more languages than ever before. It will be particularly useful for under-served languages, providing an important way to help improve education and ready access to knowledge in many countries. ... "

No comments: