Korvus: Single-Query RAG with Postgres
Korvus is a search SDK merging RAG pipeline into a Postgres query, using Python, JavaScript, and Rust bindings. It streamlines search processes, minimizes infrastructure needs, and offers detailed documentation on GitHub.
Read original articleKorvus is a search SDK consolidating the RAG pipeline into a single database query, leveraging Postgres with Python, JavaScript, and Rust bindings. It offers efficient search functionalities with reduced infrastructure overhead. The GitHub repository provides comprehensive details on Korvus, covering its features, system architecture, setup instructions, SQL advantages, documentation, community resources, and guidelines for contributions. For more information or specific inquiries about the repository, feel free to seek assistance.
Related
Show HN: R2R V2 – A open source RAG engine with prod features
The R2R GitHub repository offers an open-source RAG answer engine for scalable systems, featuring multimodal support, hybrid search, and a RESTful API. It includes installation guides, a dashboard, and community support. Developers benefit from configurable functionalities and resources for integration. Full documentation is available on the repository for exploration and contribution.
Corcel – Use WordPress backend with Laravel or any PHP application
Corcel is a PHP package integrating WordPress databases with Laravel's Eloquent ORM. It supports various WordPress features, enhancing data retrieval for PHP projects. Inquiries for specific details are welcome.
Redis Alternative at Apache Software Foundation Now Supports RediSearch and SQL
A new query engine, KQIR, supports SQL and RediSearch queries for Apache Kvrocks, a Redis-compatible database. It aims to combine performance with transaction guarantees and complex query support, utilizing an intermediate language for consistency. Future plans include expanding field types and enhancing transaction guarantees.
Open-Source Perplexity – Omniplex
The Omniplex open-source project on GitHub focuses on core functionality, Plugins Development, and Multi-LLM Support. It utilizes TypeScript, React, Redux, Next.js, Firebase, and integrates with services like OpenAI and Firebase. Community contributions are welcomed.
Announcing Polars 1.0 (Blog Post)
Polars releases Python version 1.0 after 4 years, gaining popularity with 27.5K GitHub stars and 7M monthly downloads. Plans include improving performance, GPU acceleration, Polars Cloud, and new features.
We built Korvus, an open-source RAG (Retrieval-Augmented Generation) pipeline that consolidates the entire RAG workflow - from embedding generation to text generation - into a single SQL query, significantly reducing architectural complexity and latency.
Here's some of the highlights:
- Full RAG pipeline (embedding generation, vector search, reranking, and text generation) in one SQL query
- SDKs for Python, JavaScript, and Rust (more languages planned)
- Built on PostgreSQL, leveraging pgvector and pgml
- Open-source, with support for open models
- Designed for high performance and scalability
Korvus utilizes Postgres' advanced features to perform complex RAG operations natively within the database. We're also the developers of PostgresML, so we're big advocates of in-database machine learning. This approach eliminates the need for external services and API calls, potentially reducing latency by orders of magnitude compared to traditional microservice architectures. It's how our founding team built and scaled the ML platform at Instacart.
We're eager to get feedback from the community and welcome contributions. Check out our GitHub repo for more details, and feel free to hit us up in our Discord!
I spent too long reading Python docs because I haven't touched the language since 2019. Happy to help develop a Ruby SDK!
Can it run the LLM on a GPU?
You mention pulling models from huggingface for document embedding. Is it possible to pass an hf token to use private models?
I train domain and language-specific[0] embedding and conversational models and if I can use them in Korvus I'll most likely switch to it overnight.
What am I missing? Honest question. I want to likes this :)
I see you offer re-ranking using local models, will there be build-in support for making re-ranking calls to external services such as cohere in the future?
One question: Can I use an external model (ie get the raw RAG snippets, or prompt text)? Or does it have to be the one specified in Korvus?
Related
Show HN: R2R V2 – A open source RAG engine with prod features
The R2R GitHub repository offers an open-source RAG answer engine for scalable systems, featuring multimodal support, hybrid search, and a RESTful API. It includes installation guides, a dashboard, and community support. Developers benefit from configurable functionalities and resources for integration. Full documentation is available on the repository for exploration and contribution.
Corcel – Use WordPress backend with Laravel or any PHP application
Corcel is a PHP package integrating WordPress databases with Laravel's Eloquent ORM. It supports various WordPress features, enhancing data retrieval for PHP projects. Inquiries for specific details are welcome.
Redis Alternative at Apache Software Foundation Now Supports RediSearch and SQL
A new query engine, KQIR, supports SQL and RediSearch queries for Apache Kvrocks, a Redis-compatible database. It aims to combine performance with transaction guarantees and complex query support, utilizing an intermediate language for consistency. Future plans include expanding field types and enhancing transaction guarantees.
Open-Source Perplexity – Omniplex
The Omniplex open-source project on GitHub focuses on core functionality, Plugins Development, and Multi-LLM Support. It utilizes TypeScript, React, Redux, Next.js, Firebase, and integrates with services like OpenAI and Firebase. Community contributions are welcomed.
Announcing Polars 1.0 (Blog Post)
Polars releases Python version 1.0 after 4 years, gaining popularity with 27.5K GitHub stars and 7M monthly downloads. Plans include improving performance, GPU acceleration, Polars Cloud, and new features.