September 8th, 2024

Creating a search engine for fun and because Google sucks

A software engineer is developing a free, fast, and privacy-focused search engine using Rust, sourcing results from DuckDuckGo and images from Google, while addressing challenges with pagination and HTML parsing.

Read original articleLink Icon
Creating a search engine for fun and because Google sucks

A software engineer is developing a personal search engine due to dissatisfaction with current options, particularly Google, which has become overloaded with ads and AI-generated content. The engineer aims to create a free, fast, and safe search engine using Rust, avoiding reliance on paid APIs and ensuring user privacy by not tracking searches. The initial plan involves scraping results from DuckDuckGo (DDG) and images from Google, leveraging DDG Lite for its simplicity. The engineer has encountered challenges with pagination and parsing HTML responses, leading to issues with retrieving additional search results. Despite these hurdles, the project is ongoing, with the engineer exploring alternative parsing methods and considering the use of Servo, a Rust-based browser engine. The search engine is currently in a rudimentary state, and the engineer is open to job opportunities in Rust development.

- The engineer is building a personal search engine to address dissatisfaction with existing options.

- The project focuses on being free, fast, and safe, prioritizing user privacy.

- Initial results will be sourced from DuckDuckGo and images from Google.

- Challenges include pagination issues and HTML parsing errors.

- The engineer is exploring alternative methods and is open to job opportunities in Rust development.

Link Icon 12 comments
By @jsnell - 4 months
Creating a thin, non-working wrapper around DuckDuckGo as a "hire me" ad, claiming it's a search engine and mentioning Google in the headline because it'll get more clicks.
By @ramigb - 4 months
So your "search engine" uses Google and DuckDuckGo to get results? and you are creating it because "Google sucks"? Am I missing something here?
By @TheFragenTaken - 4 months
The phonebook capabilities of pretty much all Google alternatives suck. Google is currently the only search engine that actually works for local queries (i.e. phone repair in <local town>).

Alternatives while probably fine in America, suck in the Nordics. I think people forget just how much search traffic happens in this category.

By @r_singh - 4 months
I've been using SearXNG[1] via Perplexica[2] and I couldn't be happier. It replaced Google and ChatGPT/Perplexity type search engines for me and it's the first tool I use for question answer type searches.

[1] - https://docs.searxng.org [2] - https://github.com/ItzCrazyKns/Perplexica

By @randomman1131 - 4 months
As someone who cares about their online searches actually being good, fast and private, I cannot recommend SearXNG more. https://github.com/searxng/searxng/

It's a metasearch engine that can query multiple search providers at once, including google, so you're not missing out on the good results you expect. Pick an instance at https://searx.space/ and tell your friends!

By @sneak - 4 months
I have been thinking about this. I’d like to index the blogosphere and some knowledge base sites and stuff and exclude most of the Top1000 websites.

I imagine I could do it on consumer hardware for less than $10-20k.

Perhaps common crawl has done much of the heavy lifting already and I just have an indexing task.

By @delanyoyoko - 4 months
https://vincents.dev/blog/creating-a-search-engine/?

I visited the page on mobile and I have to scroll horizontally back and forth before I can read?

That is not fun at all.

By @roschdal - 4 months
Google is a fine search engine.
By @sam29681749 - 4 months
This article doesn't scale to fit the window on Firefox on android.
By @nprateem - 4 months
Duckduckgo is now particularly riddled by AI spam. The top 5 summaries for lots of searches now begin with some variation of "In the fast-moving world of...". Utter shit.

It'd be so easy to filter I wonder why they/Microsoft don't bother. Oh, wait...