Launch HN: Midship (YC S24) – Turn PDFs and Images into usable data
Midship, founded by Max, Kieran, and Aahel, specializes in extracting data from unstructured documents using OCR and language models, catering to both non-technical users and developers with subscription and API pricing.
Midship, founded by Max, Kieran, and Aahel, specializes in extracting data from unstructured documents such as PDFs and images. Initially, the team aimed to create a natural language workflow builder as an alternative to platforms like Zapier. However, they pivoted to focus on document extraction after observing significant user interest in this feature. Many businesses still rely on PDFs and images for data, and while OCR technology exists, it often fails to provide clean, structured data. Midship addresses this issue by combining OCR with language models, enabling the extraction of specific fields and tables while correcting OCR errors and understanding context. The platform caters to two main user groups: non-technical users who utilize the web app for data extraction and developers who access the extraction API. Midship offers a subscription model for the web app and volume-based pricing for the API. The founders are eager to receive feedback from the community on their product.
- Midship focuses on extracting data from unstructured documents like PDFs and images.
- The platform combines OCR with language models to improve data extraction accuracy.
- It serves both non-technical users through a web app and developers via an API.
- The company offers a subscription model for the web app and volume-based pricing for the API.
- Feedback from the community is welcomed as the founders continue to develop their product.
Related
Show HN: Midday – Run your business smarter (open-source)
Midday is a versatile tool for freelancers and small businesses, offering financial monitoring, time-tracking, file storage, and invoicing. With 3700+ users, it features automated receipt mapping, bank integrations, live tracking, and an invoice matching inbox. Users appreciate its open-source nature and tailored financial insights.
Fathom AI Notetaker (YC W21) Is Hiring a Head of Data (Remote/US)
Fathom, an AI meeting assistant startup, is hiring a Head of Data to enhance its strategy and drive growth, requiring technical skills and experience in leading data teams.
Show HN: AutoDocument – Multi-Source Document Generation
AutoDocument is an open-source tool for automating document processes, featuring advanced templating, multi-step workflows, and support for various file storage options, receiving positive user feedback for its capabilities.
Launch HN: Roe AI (YC W24) – AI-powered data warehouse to query multimodal data
Roe AI is developing a query engine that allows SQL queries on unstructured data using LLMs, simplifying analysis for teams and offering a free trial with AI credits.
Show HN: I built a React Native boilerplate to ship mobile apps faster
ExpoShip is a React Native boilerplate that simplifies app development with features like user authentication and payment integration, catering to both beginners and experienced developers, and offering promotional discounts.
I want to create something that can paginate through 12 pages of html, perform clicks, download pdf fund factsheet, extract data from this factsheet into excel or CSV. Can this help? What's the best way to deal with the initial task of automating webpage interactions systematically?
Tried the examples - they seem tailored for specific document types. I have two questions around that: (a) is their a "best-effort" extraction you can perform or plan to support if you don't know the document type? (b) do you plan to support extraction from academic papers, i.e., potentially multi-column, with images, tables that are either single column or span two columns, equations, etc.?
Curious what are the biggest complain from your users? Are they willing to manually auditing the numbers in the table, make sure the output is 1. accurate. 2. formatted in the table they expected.
I’m curious to hear more about your pivot from AI workflow builder to document parsing. I can see correlations there, but that original idea seems like a much larger opportunity than parsing PDFs to tables in what is an already very crowded space. What verticals did you find have this problem specifically that gave you enough conviction to pivot?
Can you do this with emails?
- https://www.ycombinator.com/companies/tableflow
- https://www.ycombinator.com/companies/reducto
- https://www.ycombinator.com/companies/mindee
- https://www.ycombinator.com/companies/omniai
- https://www.ycombinator.com/companies/trellis
At the same time, accurate document extraction is becoming a commodity with powerful VLMs. Are you planning to focus on a specific industry, or how do you plan to differentiate?
Related
Show HN: Midday – Run your business smarter (open-source)
Midday is a versatile tool for freelancers and small businesses, offering financial monitoring, time-tracking, file storage, and invoicing. With 3700+ users, it features automated receipt mapping, bank integrations, live tracking, and an invoice matching inbox. Users appreciate its open-source nature and tailored financial insights.
Fathom AI Notetaker (YC W21) Is Hiring a Head of Data (Remote/US)
Fathom, an AI meeting assistant startup, is hiring a Head of Data to enhance its strategy and drive growth, requiring technical skills and experience in leading data teams.
Show HN: AutoDocument – Multi-Source Document Generation
AutoDocument is an open-source tool for automating document processes, featuring advanced templating, multi-step workflows, and support for various file storage options, receiving positive user feedback for its capabilities.
Launch HN: Roe AI (YC W24) – AI-powered data warehouse to query multimodal data
Roe AI is developing a query engine that allows SQL queries on unstructured data using LLMs, simplifying analysis for teams and offering a free trial with AI credits.
Show HN: I built a React Native boilerplate to ship mobile apps faster
ExpoShip is a React Native boilerplate that simplifies app development with features like user authentication and payment integration, catering to both beginners and experienced developers, and offering promotional discounts.