Is thus like google search but better and European or am i misreading?
Europe
News and information from Europe 🇪🇺
(Current banner: La Mancha, Spain. Feel free to post submissions for banner images.)
Rules (2024-08-30)
- This is an English-language community. Comments should be in English. Posts can link to non-English news sources when providing a full-text translation in the post description. Automated translations are fine, as long as they don't overly distort the content.
- No links to misinformation or commercial advertising. When you post outdated/historic articles, add the year of publication to the post title. Infographics must include a source and a year of creation; if possible, also provide a link to the source.
- Be kind to each other, and argue in good faith. Don't post direct insults nor disrespectful and condescending comments. Don't troll nor incite hatred. Don't look for novel argumentation strategies at Wikipedia's List of fallacies.
- No bigotry, sexism, racism, antisemitism, islamophobia, dehumanization of minorities, or glorification of National Socialism. We follow German law; don't question the statehood of Israel.
- Be the signal, not the noise: Strive to post insightful comments. Add "/s" when you're being sarcastic (and don't use it to break rule no. 3).
- If you link to paywalled information, please provide also a link to a freely available archived version. Alternatively, try to find a different source.
- Light-hearted content, memes, and posts about your European everyday belong in !yurop@lemm.ee. (They're cool, you should subscribe there too!)
- Don't evade bans. If we notice ban evasion, that will result in a permanent ban for all the accounts we can associate with you.
- No posts linking to speculative reporting about ongoing events with unclear backgrounds. Please wait at least 12 hours. (E.g., do not post breathless reporting on an ongoing terror attack.)
- Always provide context with posts: Don't post uncontextualized images or videos, and don't start discussions without giving some context first.
(This list may get expanded as necessary.)
Posts that link to the following sources will be removed
- on any topic: RT, news-pravda:com, GB News, Fox, Breitbart, Daily Caller, OAN, sociable:co, citjourno:com, brusselssignal:eu, europesays:com, geo-trends:eu, any AI slop sites (when in doubt please look for a credible imprint/about page), change:org (for privacy reasons)
- on Middle-East topics: Al Jazeera
- on Hungary: Euronews
Unless they're the only sources, please also avoid The Sun, Daily Mail, any "thinktank" type organization, and non-Lemmy social media. Don't link to Twitter directly, instead use xcancel.com. For Reddit, use old:reddit:com
(Lists may get expanded as necessary.)
Ban lengths, etc.
We will use some leeway to decide whether to remove a comment.
If need be, there are also bans: 3 days for lighter offenses, 7 or 14 days for bigger offenses, and permanent bans for people who don't show any willingness to participate productively. If we think the ban reason is obvious, we may not specifically write to you.
If you want to protest a removal or ban, feel free to write privately to the primary mod account @EuroMod@feddit.org
In an nutshell, this is what I understand, too. It may take some time until it gets fully competitive but it could soon get a better alternative to the gatekeepers like Google imho.
Addition for a brief article I just found:
The EU’s Open Web Index Project: Another Step Toward Digital Independence
The Open Web Index (OWI) is an open-source initiative under the European Union’s Horizon Programme, aimed at democratizing web-search technologies and strengthening Europe’s digital sovereignty. The project will launch in June 2025, providing a common web index accessible to all and decoupling the indexing infrastructure from the search services that use it. In doing so, the OWI offers not only technical innovations but also a paradigm shift in the global search market—today, a single player (Google) holds over ninety percent of the market share and determines access to online information.
The project’s core idea is to make web crawling, metadata enrichment, and indexing a shared European resource. Development takes place in large data centres that process terabytes of raw data each day and publish the entire index as open data. All software components are open-source, and the CIFF format ensures that systems based on Lucene, Solr, or Terrier can connect to the OWI seamlessly. Thus, with minimal effort, researchers and developers can create vertical search engines that rank results according to specific criteria such as sustainability or privacy priorities [...]
Hm sounds more like just a web scrape to me. I.e. someone else will need to build a search engine on top of this.
It’s an index, not a web scrape, though if you want to think about it that way then go for it. Indexes back all search engines.
Fingers crossed
Only few search engines index the Web at scale. Third parties who want to develop downstream applications based on web search fully depend on the terms and conditions of the few vendors. The public availability of the large-scale Common Crawl does not alleviate the situation, as it is often cheaper to crawl and index only a smaller collection focused on a downstream application scenario than to build and maintain an index for a general collection the size of the Common Crawl. Our goal is to improve this situation by developing the Open Web Index.
The Open Web Index is a publicly funded basic infrastructure from which downstream applications will be able to select and compile custom indexes in a simple and transparent way. Our goal is to establish the Open Web Index along with associated data products as a new open web information intermediary.
https://downloads.webis.de/publications/papers/hendriksen_2024.pdf
This paper seems to give a good, quick overview.
It looks to be the usual EU tech project. Doing more to achieve less in a desperate, hopeless attempt to make up for the stupidity and greed of European elites.
Given that the European Commission has launched the InvestAI initiative to mobilize €200 billion of investment in artificial intelligence, the Open Web Index comes with perfect timing.
But these dumbfucks cutted the awesome NGI Zero Grant for this. Which funded many awesome Open Source Projects