Why don't we have personalized search engines?
Users criticize search engines like Google for prioritizing ads over content, expressing a need for a unified tool to access diverse content types and better organize personal notes across applications.
The current state of search engines, particularly Google, is criticized for prioritizing advertisements over valuable content, leading to less effective search results. Users express a desire for a comprehensive tool that consolidates various types of content, including previously read articles, curated recommendations from trusted sources, purchased books, and personal notes stored across multiple applications. The need for a more efficient and user-friendly search experience is evident, as existing solutions do not adequately address these requirements.
- Users find current search engines inadequate due to ad-centric results.
- There is a demand for a unified tool to search diverse content types.
- Users want to access curated content from trusted sources easily.
- The need for better organization of personal notes across different apps is highlighted.
Related
Google rejected me and now I'm building a search engine
The article recounts a rejection from Google during an interview, prompting the individual to create a non-profit, community-driven search engine emphasizing ethical values over profit, welcoming contributions for development.
'Google says I'm a dead physicist': is the biggest search engine broken?
Google faces scrutiny over search result accuracy and reliability, with concerns about incorrect information and cluttered interface. Despite dominance in the search market, criticisms persist regarding data privacy and search quality.
Breaking up Google would offer a chance to remodel the web
The antitrust ruling against Google reveals its illegal monopoly, hindering competition. Alternatives exist but struggle. A post-Google internet aims for user privacy and diverse, non-invasive services.
Digital Tools I Wish Existed (2019)
The article highlights challenges in managing digital content, emphasizing the need for effective tools for organization, retrieval, and integration, advocating for a more interconnected approach to personal data management.
Google's AI search gives sites dire choice: share data or die
Google's AI Overviews diminish traffic to original content, forcing publishers to choose between sharing content or losing visibility. Antitrust scrutiny may prompt changes in Google's operations and data sharing practices.
(especially calendar events, which used to be fun to track but everyone seems to have given up on event listings).
It wouldn’t have to track me, or infer other nefarious dimensions of my online habits, just target the things I’m asking to be targeted for.
I’m guessing that the implicit data dimensions of current tech is aggregating so much additional data about everyone that the recommendations we end up getting aren’t that great.
None of Google, Netflix, or Amazon get me at all, and I keep shoveling my habits right into their gaping data maw.
I did write a script that does the downloading part. It looks at my browser history and downloads the text of every site going back years.
Ditto for decades worth of email. I want to see if I ask it for my nephew's birthday, will it figure it out?
Should be doable without much difficulty.
It's not that Google doesn't know technically how to give good results. It's really that Google is optimizing for profit, not for quality. In a system that makes it extremely difficult for anyone to compete (and whoever succeeded with that would presumably end up in the same situation and optimize for profit).
I never connected my other accounts, as I found the idea of a 3rd party having access to crawl and catalog them uncomfortable.
Neeva has since shutdown and was acquired by Snowflake.
What you’re mentioning, would likely require a company to have a very large monopoly for a very long time, where all a person’s digital media was controlled by one company. Google is close, but for book people paid for, that’s something that would fall more into Amazon’s territory. Apple also has a bookstore, so maybe it would work for people who are 100% in Apple’s ecosystem and never stray, and then only have friends with people also in Apple’s ecosystem (for the people you trust feature).
I don’t think we’ll every see enough benevolent cooperation between companies, without ulterior motives, to do something like this well without it also being a security nightmare.
- no_SEO: demote anything which employs 'SEO' so it appear below search results not guilty of this sin
- no_Blogspam: demote blogspam below the original articles the bloggers refer to
- no_Sales: demote anything which tries to sell me something below results which do not. This is a tricky one to implement because not every site offering to sell something should be caught, e.g. a site explaining how to repair a flux capacitor which links to a source for these ubiquitous parts but mostly contains instructions how to install and tune the part is fine.
- no_GPT: demote anything recognised as being generated through 'AI'.
- $filter: an option to create custom filters
Depending on the reason for searching the 'net I'd have most of these options enabled but every now and then I'd switch one off, e.g. no_SEO/no_Sale when looking for something to buy.
I'm running an instance of SearxNG and hardly ever interface directly with individual search engines so I mostly avoid the 'personalisation' problem but I do not yet have access to filtering options like the ones I mentioned.
To been able to avoid commercial search engines we have only an option: public funding public universities who cure national infra (something already existing, but bigger) and a public indexing project with a national plan for a homeserver per connected home (much like actual ISPs 'router', only pure FLOSS handled by the user or using anyway public code) witch in the other functions also index a small part of the web in an open project like YaCy. Same thing for VoIP comms.
WE DAMN NEED institutionalized FLOSS.
This exists, it's here, and no one uses it for anything except serving you better ads.
And among 95% of normal users there's no demand for it because what most people do is google restaurants, cinemas, dancing videos on TikTok or they just add "reddit" to their search for anything more complicated. Most people haven't bought any reading material on the internet and don't have notes.
Thus a personalized search engine could double as a forum moderator.
And you could share search engines. Get a copy of the search engine of somebody that you admire/trust and merge it with your own. Thus your search engine could learn from others what's good and bad.
You could have a family search engine, passed down through the generations.
When people started talking about LLMs and AI I was hoping for something that would monitor news and websites and find things that I was interested in. Something that would go beyond just keyword searches and also be able to pull in stories on radio and tv.
Basically, it smells like a solved problem wiht open source tools built for Enterprise. I thought then and think now it could be scaled down to a hardware appliance that sits on a home network. But I am probably wrong about all of it. Good luck.
Related
Google rejected me and now I'm building a search engine
The article recounts a rejection from Google during an interview, prompting the individual to create a non-profit, community-driven search engine emphasizing ethical values over profit, welcoming contributions for development.
'Google says I'm a dead physicist': is the biggest search engine broken?
Google faces scrutiny over search result accuracy and reliability, with concerns about incorrect information and cluttered interface. Despite dominance in the search market, criticisms persist regarding data privacy and search quality.
Breaking up Google would offer a chance to remodel the web
The antitrust ruling against Google reveals its illegal monopoly, hindering competition. Alternatives exist but struggle. A post-Google internet aims for user privacy and diverse, non-invasive services.
Digital Tools I Wish Existed (2019)
The article highlights challenges in managing digital content, emphasizing the need for effective tools for organization, retrieval, and integration, advocating for a more interconnected approach to personal data management.
Google's AI search gives sites dire choice: share data or die
Google's AI Overviews diminish traffic to original content, forcing publishers to choose between sharing content or losing visibility. Antitrust scrutiny may prompt changes in Google's operations and data sharing practices.