August 28th, 2024

Meta is accused of "bullying" the open-source community

Meta faces accusations of "bullying" the open-source community while trying to establish its AI models as the standard, creating a divide between purists and users attracted to its offerings.

Read original articleLink Icon
Meta is accused of "bullying" the open-source community

Meta is facing accusations of "bullying" the open-source community as it seeks to establish its artificial intelligence models as the standard in the field. The situation is likened to a scenario where a large corporation takes over a long-standing communal space, imposing new norms that clash with the existing culture. Open-source purists are expressing their concerns over Meta's approach, which they perceive as undermining the foundational principles of open-source collaboration. Despite the objections from these purists, many users are still drawn to Meta's offerings, indicating a divide within the community. The implications of Meta's actions could significantly impact the future of open-source AI development and the dynamics between large corporations and independent developers.

- Meta is accused of undermining the open-source community's principles.

- The company's efforts aim to set a new standard for AI models.

- There is a growing divide between open-source purists and users attracted to Meta's offerings.

- The situation reflects broader tensions between large corporations and independent developers in the tech space.

Link Icon 11 comments
By @Chance-Device - 6 months
Maybe by bureaucrats in the OSI. Meta through the Llama models have done more for open source LLMs than just about anyone else, which the community recognises.

The perfect is the enemy of the good, as usual.

By @echelon - 6 months
By @kdhanx - 6 months
Nice that the Economist picks this up. In addition to the article, which is 100% correct, Meta sucks the air out of the room with PyTorch, which stifles other true open source efforts. Instagram has way too much influence on Python, where a couple of companies have wrestled away control over the org from dozens of independent developers and push their pet projects of questionable quality.

All of this needs to stop.

By @psunavy03 - 6 months
Thank you, author, for starting your article off with a mental picture I decidedly did NOT need.
By @impure - 6 months
I'm a little confused what the opening paragraph has to do with the rest of the article.

The OSI's definition is still new. In fact it's still in draft form so this debate is a bit premature. I suspect that companies will begin releasing the code to train the model once it is finalized (they will never tell you the training data due to legal reasons and forcing them to is a losing battle).

By @ponty_rick - 6 months
"Which raises the tantalising question: will Zuck ever have the pluck to bare it all?"

This last sentence gave me a headache. He does not need to publish a bunch of possibly proprietary data owned by Meta, so no he won't bare it all. Data isn't free.

By @Manuel_D - 6 months
What are the consequences as far as releasing the weights but not the training data? From what I understand, Llama and other open-weight models can be freely used and modified. What can people not do presently, but could do with an open-data model?
By @roflchoppa - 6 months
Love the Hawaii reference in the opener. Cheeky :)
By @animitronix - 6 months
Lol, did they break someone's "code of conduct"?
By @mrbluecoat - 6 months
Just can't take an article seriously when it starts with "Meta, the social-media giant controlled by a mankini-clad Mark Zuckerberg."
By @darksaints - 6 months
I find this obsession with distinguishing between open weight and open source foolish and counterproductive.

The model architecture and infrastructure are open source. That is what matters.

The fact that you get really good weights that result from millions of dollars of GPU time on extremely expensive-to-procure proprietary datasets is amazing, but even that shouldn't be a requirement to call this open source. That is literally just an output of the open source model trained on non-open source inputs.

I find it absurd that if I create a model architecture, publish my source code, and slap an open source license on it, I can call that open source…but the moment I publish some weights that are the result of running the program on some proprietary dataset, all of a sudden I can’t call it open source anymore.