The thing is that the Wiki mods will need to be more diligent with uncited things. I also see 2 massive opportunities here. First is that they can have agents check the cited source and verify whether the source backs up what's said to a reasonable degree. Second opportunity is fitting in things only found in other language Wikis that either be incorporated into the english one or help introduce new articles. Believe it or not, LLMs can't generate english answers for things answered only in Russian (or any language) in the training data.
> First is that they can have agents check the cited source and verify whether the source backs up what's said to a reasonable degree.
This is a hard and tmk unsolved NLP/IR problem, and data access is an issue.
> Second opportunity is fitting in things only found in other language Wikis that either be incorporated into the english one or help introduce new articles.
This has been attempted via machine translation in the past, and it failed because you need native speakers to verify and correct the translations and this wasn't the sort of work that people were jumping to volunteer to do.
I speak multiple languages. It'd be fun to be given a link saying "this article has content present that is not in the English version" and see if I can update either of the articles using what's in the other copy. But I'm not going to go read wikipedia articles in two languages to find them.
>>LLMs can't generate english answers for things answered only in Russian in the training data.
For multilingual LLM’s? Why do you think that?
An LLM can translate inputs of arbitrary Russian text. If there were an English question about something only in the training data as Russian, I would expect an answer - with the quality being on par with its general translation capabilities.