August 23rd, 2024

HuggingFace to Replace Git LFS with Xet

Hugging Face has acquired XetHub to enhance AI development, improving storage and versioning for large files, facilitating efficient updates, and supporting growth in the AI community and infrastructure team.

Read original articleLink Icon
HuggingFace to Replace Git LFS with Xet

Hugging Face has officially acquired XetHub, a Seattle-based company focused on enhancing software engineering practices for AI development. Founded by former Apple employees Yucheng Low, Ajit Banerjee, and Rajat Arya, XetHub specializes in technologies that allow Git to manage large repositories and facilitate collaboration on extensive datasets and models. The integration of XetHub's technology is expected to significantly improve Hugging Face's storage and versioning capabilities, moving away from Git LFS to a more optimized solution tailored for the large files typical in AI. This transition will enable more efficient updates to large files, such as only re-uploading modified chunks rather than entire datasets. With the growing complexity of AI models, including those with trillions of parameters, the collaboration challenges will also be addressed through enhanced tools for understanding data and model evolution. The acquisition aims to bolster Hugging Face's growth in datasets and models over the next five years, while also expanding their infrastructure team.

- Hugging Face has acquired XetHub to enhance AI development practices.

- XetHub's technology will improve storage and versioning for large AI files.

- The integration aims to facilitate more efficient updates to datasets and models.

- The acquisition is expected to support Hugging Face's growth in the AI community.

- Hugging Face is hiring for its infrastructure team to support this initiative.

Related

Big Tech's playbook for swallowing the AI industry

Big Tech's playbook for swallowing the AI industry

Amazon strategically hires Adept AI team to sidestep antitrust issues. Mimicking Microsoft's Inflection move, Amazon's "reverse acquihire" trend absorbs AI startups, like Adept, facing financial struggles. Big Tech adapts to regulatory challenges by emphasizing talent acquisition and tech licensing.

Apple just launched a public demo of its '4M' AI model

Apple just launched a public demo of its '4M' AI model

Apple publicly launches its '4M' AI model with EPFL on Hugging Face Spaces, showcasing versatile capabilities across modalities. The move signals a shift towards transparency, aligning with market growth and emphasizing user privacy amid ethical concerns.

HuggingFace releases support for tool-use and RAG models

HuggingFace releases support for tool-use and RAG models

The GitHub repository of Hugging Face Transformers provides details on a versatile library for NLP, computer vision, and audio tasks. Users can access it for learning and implementation. For more information, inquire within.

AI Humane execs leave company to found AI fact-checking startup

AI Humane execs leave company to found AI fact-checking startup

Two former Humane executives, Brooke Hartley Moy and Ken Kocienda, establish Infactory, a startup focusing on AI fact-checking. They emphasize accurate data sources, subscription pricing for enterprise customers, and future seed funding. Launch imminent.

HuggingFace: Scaling open source community 300%/yr with No OKRs and No Meetings

HuggingFace: Scaling open source community 300%/yr with No OKRs and No Meetings

HuggingFace, under Chief Llama Officer Omar Sanseviero, grew its open-source ML community from 780k to 2.3m repos in a year. They prioritize community collaboration, metrics, and engagement over revenue, with a decentralized structure and no OKRs.

Link Icon 5 comments
By @Kelteseth - 6 months
Git Repo: https://github.com/xetdata/xet-core

Sounds promising. Git LFS works ok ish, had often issues with it.

By @wdkrnls - 6 months
Getting git to handle large data sets in regular repositories seems like a great way to go for many reasons. What is preventing the ideas of Xet from landing into git directly?
By @hhthrowaway1230 - 6 months
The docs say:

> it can also be configured against a local data store for a fully free and open source route to managing large repositories.

I can't find the docs for hosting oneself is that maybe available?

By @thelastparadise - 6 months
does this break huggingface-cli download??