Size of Wikipedia
English Wikipedia has 6,841,238 articles with 4.5 billion words, 60,922,102 pages, and a growth rate of 14,000 articles monthly. Word count per article averages 668, showing consistent growth.
Read original articleThe English Wikipedia's size is measured by the number of articles, words, pages, and database size. As of June 26, 2024, it contains 6,841,238 articles with over 4.5 billion words, averaging about 668 words per article, and a total of 60,922,102 pages. The growth rate shows an increase of about 14,000 articles per month as of January 2024. While the number of new articles added monthly has decreased since 2006, the amount of text added annually remains constant, indicating more content is added to existing articles. Bot-generated Wikipedias like Cebuano grow faster but have shorter articles compared to human-written ones like English Wikipedia. The growth model of English Wikipedia articles has shown a consistent rate of growth over the years. The total number of words in English Wikipedia articles has steadily increased, with over 4.5 billion words as of February 2024, averaging about 670 words per article. Monthly statistics since January 2019 show a continuous increase in word count.
Related
Wc2: Investigates optimizing 'wc', the Unix word count program
The GitHub project "wc2" presents an innovative algorithm for the `wc` program, focusing on asynchronous state-machine parsing in C and JavaScript. It enhances efficiency, scalability, and speed compared to traditional `wc` implementations.
Delving into ChatGPT usage in academic writing through excess vocabulary
A study by Dmitry Kobak et al. examines ChatGPT's impact on academic writing, finding increased usage in PubMed abstracts. Concerns arise over accuracy and bias despite advanced text generation capabilities.
Words you can spell with a calculator (2005)
The website paperlined.org hosts a word frequency list last updated in 2005. It includes words like bee, bell, bible, egg, glee, ill, leg, zoo, and names like Bob, Hill, Zoe.
Wikipedia: 97% of all articles lead to Philosophy
The "Getting to Philosophy" phenomenon on Wikipedia involves navigating articles by clicking the first non-parenthesized, non-italicized link, often leading to the Philosophy article. This trend, starting around 2008, saw a decrease in success rates in May 2024 due to a loop between Awareness and Psychology.
ESM3, EsmGFP, and EvolutionaryScale
EvolutionaryScale introduces ESM3, a language model simulating 500 million years of evolution. ESM3 designs proteins with atomic precision, including esmGFP, a novel fluorescent protein, showcasing its potential for innovative protein engineering.
Related
Wc2: Investigates optimizing 'wc', the Unix word count program
The GitHub project "wc2" presents an innovative algorithm for the `wc` program, focusing on asynchronous state-machine parsing in C and JavaScript. It enhances efficiency, scalability, and speed compared to traditional `wc` implementations.
Delving into ChatGPT usage in academic writing through excess vocabulary
A study by Dmitry Kobak et al. examines ChatGPT's impact on academic writing, finding increased usage in PubMed abstracts. Concerns arise over accuracy and bias despite advanced text generation capabilities.
Words you can spell with a calculator (2005)
The website paperlined.org hosts a word frequency list last updated in 2005. It includes words like bee, bell, bible, egg, glee, ill, leg, zoo, and names like Bob, Hill, Zoe.
Wikipedia: 97% of all articles lead to Philosophy
The "Getting to Philosophy" phenomenon on Wikipedia involves navigating articles by clicking the first non-parenthesized, non-italicized link, often leading to the Philosophy article. This trend, starting around 2008, saw a decrease in success rates in May 2024 due to a loop between Awareness and Psychology.
ESM3, EsmGFP, and EvolutionaryScale
EvolutionaryScale introduces ESM3, a language model simulating 500 million years of evolution. ESM3 designs proteins with atomic precision, including esmGFP, a novel fluorescent protein, showcasing its potential for innovative protein engineering.