Show HN: R2R V2 – A open source RAG engine with prod features
The R2R GitHub repository offers an open-source RAG answer engine for scalable systems, featuring multimodal support, hybrid search, and a RESTful API. It includes installation guides, a dashboard, and community support. Developers benefit from configurable functionalities and resources for integration. Full documentation is available on the repository for exploration and contribution.
Read original articleThe R2R GitHub repository hosts an open-source RAG answer engine aimed at facilitating the transition from local LLM experimentation to scalable RAG systems. It offers a robust RAG system with features like multimodal support, hybrid search, and graph RAG, all accessible through a RESTful API. The repository contains installation instructions, quickstart guides, a user-friendly dashboard, and community support details. Developers can benefit from configurable and extensible functionalities, along with client-server support for seamless integration. Additionally, the repository provides resources such as documentation, cookbooks, and examples to aid in utilizing various R2R features and integrations. For those interested in exploring further or contributing, the full documentation is available on the R2R GitHub Repository.
Related
AI-powered conversion from Enzyme to React Testing Library
Slack engineers transitioned from Enzyme to React Testing Library due to React 18 compatibility issues. They used AST transformations and LLMs for automated conversion, achieving an 80% success rate.
OSRD: Open-Source Railway Designer
The OSRD is an open-source web app for railway planning, capacity analysis, and timetabling. It supports infrastructure design, conflict detection, and automatic train addition. The project promotes open-source development and interoperability.
React Lua
The GitHub repository focuses on translating ReactJS 17.x into Lua for Roblox and Lua community. It aims for performance and accuracy, welcoming contributions from notable developers like Paul Doyle and Matt Hargett.
Show HN: FiddleCube – Generate Q&A to test your LLM
FiddleCube on GitHub helps create question-answer datasets for Large Language Models. It includes a guide, examples, and details on generating ideal datasets for testing, evaluating, and training LLMs. For more information, visit the GitHub page.
Show HN: a Rust lib to trigger actions based on your screen activity (with LLMs)
The GitHub project "Screen Pipe" uses Large Language Models to convert screen content into actions. Implemented in Rust + WASM, inspired by `adept.ai`, `rewind.ai`, and `Apple Shortcut`. Open source under MIT license.
We have a customer who has hundreds of thousands of unstructured and diverse PDFs (containing tables, forms, checkmarks, images, etc.), and they need to accurately convert these PDFs into markdown for RAG usage.
Traditional OCR approaches fall short in many of these cases, so we've started using a combined multimodal LLM + OCR approach that has led to promising accuracy and consistency at scale (ping me if you want to give this a try). The RAG system itself is not a big pain point for them, but the accurate and efficient extraction and structuring of the data is.
I am leaving my position, and I recommended this to basically replace me with a junior dev who can just hit the API endpoints.
If I want to use dashboard I have to clone another repo? 'git clone git@github.com:SciPhi-AI/R2R-Dashboard.git' ? why not make it available in a docker container so that if im only interested in rag I can plug into the docker container for dashboard?
This project feels like a collection of alot of things thats not really providing any extra ease to development. It feels more like joining a new company and trying to find out all the repo and set everything up.
This really looks cool, but Im struggling to figure out if its a SDK or suite of apps or both but in the later case the suite of apps is really confusing if i have to still write all the python, then it feels more like a SDK?
Perhaps provide better "1 click" install experience to preview/show case all the features and then let devs leverages the r2r lalter...
It really seems like document chunking is not a problem that can be solved well generically. And RAG really hinges on which documents get retrieved/the correct metadata.
Current approaches around this seem to be using a ReRanker, where we fetch a ton of information and prune it down. But still, document splitting, is tough. Especially when you start to add transcripts of video that can be a few hours long.
Can R2R be built with all processing steps implementing local "open" models?
How do I do a bulk/batch ingest of say, 10k html documents into this system?
"List all YC founders that worked at Google and now have an AI startup."
How to check the accuracy of the answers? Is there some kind of a detailed trace of how the answer was generated?
Lightweight is good, and running it without having to deal with Docker is excellent.
But your quickstart guide is still huge! It feels very much not "quick". How do you:
* Install via Python
* Throw a folder of documents at it
* Have it set there providing a REST API to get results?
Eg suppose I have an AI service already, so I throw up a private Railway instance of this as a Python app. There's a DB somewhere. As simple as possible. I can mimic it at home just running a local Python server. How do I do that? _That's_ the real quickstart.
Related
AI-powered conversion from Enzyme to React Testing Library
Slack engineers transitioned from Enzyme to React Testing Library due to React 18 compatibility issues. They used AST transformations and LLMs for automated conversion, achieving an 80% success rate.
OSRD: Open-Source Railway Designer
The OSRD is an open-source web app for railway planning, capacity analysis, and timetabling. It supports infrastructure design, conflict detection, and automatic train addition. The project promotes open-source development and interoperability.
React Lua
The GitHub repository focuses on translating ReactJS 17.x into Lua for Roblox and Lua community. It aims for performance and accuracy, welcoming contributions from notable developers like Paul Doyle and Matt Hargett.
Show HN: FiddleCube – Generate Q&A to test your LLM
FiddleCube on GitHub helps create question-answer datasets for Large Language Models. It includes a guide, examples, and details on generating ideal datasets for testing, evaluating, and training LLMs. For more information, visit the GitHub page.
Show HN: a Rust lib to trigger actions based on your screen activity (with LLMs)
The GitHub project "Screen Pipe" uses Large Language Models to convert screen content into actions. Implemented in Rust + WASM, inspired by `adept.ai`, `rewind.ai`, and `Apple Shortcut`. Open source under MIT license.