Skip to content

ronan@ronanmcgovern.com

What Makes a Great Language Model?

*Isn’t it surprising…?*

…that training a language model on the web’s data works? Isn’t there a lot of rubbish on the web? How does the model know what information to focus on?

“Correct” or “high quality” data has a tendency to repeat itself. There might be some data saying that the sun moves around the earth, but there is much more saying that the earth moves around the sun. This makes it possible for us to train language models on large datasets, even if they contain some, or even a lot, of weak information or arguments. Weak arguments and information tend to differ from each other, whereas stronger arguments or information tends to be articulated, transmitted and replicated more coherently…

Tales from Türkiye

What is today called Turkey (the Republic of Türkiye) was founded as a secular state in 1923 by Atatürk. The man’s face adorned every second building in Turkey as we approached the 100 year anniversary on October 29th 2023.