6 days ago
All that to say, a search engine cannot be your sole source of information and discovery. Its strength is in helping you find specific things when you need them, but for a well-rounded information gathering experience, we all need to put more faith into other discovery methods, especially for the independent, secret web.
08 Sep 25
Here you will find all search engines which offer English-language results that the world has to offer, what type of search engine they are and where they get their organic results from.
In plain English, this service looks at which websites link to a particular target website, and then it ranks websites that are popular among those linking websites using a method commonly used in recommendation algorithms.
In technical jargon, it reinterprets the incident edges in the adjacency matrix as sparse high dimensional vector, and uses cosine similarity to find the nearest neighbors nodes within this feature-space.
This is a write-up about an experiment from a few months ago, in how to find websites that are similar to each other. Website similarity is useful for many things, including discovering new websites to crawl, as well as suggesting similar websites in the Marginalia Search random exploration mode.
Yet another independent search engine for the Small Web.
18 Aug 25
EUが支援する非営利でOpen Sourceな検索エンジン
Brave Browser開発元による検索エンジン。GAFAM外のアメリカ企業の検索エンジンとして唯一日本語に対応している
17 Aug 25
書影の利用確認ができることが特徴の本のためのdatabase
10 Aug 25
Lemmy Instance, Lemmy Communityの検索エンジン
08 Aug 25
07 Aug 25
https://www.yomiuri.co.jp/national/20250807-OYT1T50151/ 読売新聞の本件についての記事
そりゃ2005年にGoogle Newsモドキを訴訟[平成17(ネ)10049]した読売新聞がrobots.txtすら守らないAI企業を許せるわけがないよね
06 Aug 25
fedibird管理人によるMastodonの検索書式の早見表。もちろんfedibird独自の機能も詳解
05 Aug 25
04 Aug 25
“TollBitの調査では、サイトへのアクセス1回あたりのスクレイピング回数は、Perplexityが369回、Anthropicに至っては8692回にものぼる”
robots.txtを顧みないサービスをビジネスに組み込むとは、さすがネトランを発行していた企業ではある
clouldflareは大嫌いだけど、このrobots.txtのためのイナゴAIとの戦いはめっちゃ応援してる