使用集合让一切井井有条
根据您的偏好保存内容并对其进行分类。
Google 用户触发的抓取工具列表
用户触发的抓取工具由用户启动,以在 Google 产品中执行抓取功能。例如,Google 网站验证工具会根据用户的请求执行操作,或者 Google Cloud (GCP) 上托管的网站具有允许网站用户检索外部 RSS Feed 的功能。由于是用户请求的抓取,因此这些抓取工具通常会忽略 robots.txt 规则。Google 抓取工具的常规技术属性也适用于用户触发的抓取工具。
用户触发的抓取工具使用的 IP 范围会发布在 user-triggered-fetchers.json 和 user-triggered-fetchers-google.json 对象中。用户触发的抓取工具的反向 DNS 掩码(取决于抓取工具是 Google 还是用户拥有)分别与 ***-***-***-***.gae.googleusercontent.com
或 google-proxy-***-***-***-***.google.com
匹配。
以下列表显示了用户触发的抓取工具、它们在 HTTP 请求中显示的用户代理字符串以及它们关联的产品。此列表并非详尽无遗,仅涵盖更有可能出现在日志文件中且我们收到过相关问题的请求者。
-
HTTP 请求中的用户代理 |
FeedFetcher-Google; (+http://www.google.com/feedfetcher.html)
|
相关产品 |
Feedfetcher 用于为 Google 新闻和 PubSubHubbub 抓取 RSS 或 Atom Feed。
|
Google 发布商中心
-
HTTP 请求中的用户代理 |
GoogleProducer; (+https://developers.google.com/search/docs/crawling-indexing/google-producer)
|
相关产品 |
Google 发布商中心会抓取并处理发布商明确提供的 Feed,以便在 Google 新闻着陆页中使用。
|
-
HTTP 请求中的用户代理 |
移动版代理 |
Mozilla/5.0 (Linux; Android 10; K) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/138.0.0.0 Mobile Safari/537.36 (compatible; Google-Read-Aloud; +https://support.google.com/webmasters/answer/1061943)
|
桌面版代理 |
Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/137.0.0.0 Safari/537.36 (compatible; Google-Read-Aloud; +https://support.google.com/webmasters/answer/1061943)
|
曾用代理(已弃用) |
google-speakr
|
|
相关产品 |
根据用户请求,Google Read Aloud 会使用文字转语音 (TTS) 技术来抓取并朗读网页内容。
|
-
HTTP 请求中的用户代理 |
Mozilla/5.0 (compatible; Google-Site-Verification/1.0)
|
相关产品 |
Google 网站验证工具会抓取 Search Console 验证令牌。
|
如未另行说明,那么本页面中的内容已根据知识共享署名 4.0 许可获得了许可,并且代码示例已根据 Apache 2.0 许可获得了许可。有关详情,请参阅 Google 开发者网站政策。Java 是 Oracle 和/或其关联公司的注册商标。
最后更新时间 (UTC):2025-08-04。
[null,null,["最后更新时间 (UTC):2025-08-04。"],[[["\u003cp\u003eUser-triggered fetchers are initiated by users within Google products and generally disregard robots.txt rules.\u003c/p\u003e\n"],["\u003cp\u003eThese fetchers utilize specific IP ranges and reverse DNS masks for identification, detailed in provided JSON files.\u003c/p\u003e\n"],["\u003cp\u003eGoogle products like Feedfetcher, Google Publisher Center, Google Read Aloud, and Google Site Verifier utilize user-triggered fetching for various purposes.\u003c/p\u003e\n"],["\u003cp\u003eA table lists these fetchers with their user agent strings and associated products for easier identification in log files, although user agent strings can be spoofed.\u003c/p\u003e\n"],["\u003cp\u003eVerifying the visitor's identity as a Google crawler is crucial and can be achieved using the provided verification guide.\u003c/p\u003e\n"]]],["User-triggered fetchers, initiated by user actions within Google products, ignore `robots.txt` rules. These fetchers' IP ranges are in `user-triggered-fetchers.json` and `user-triggered-fetchers-google.json`, with reverse DNS masks either `*.gae.googleusercontent.com` or `google-proxy-*.google.com`. Key fetchers include: Feedfetcher for RSS/Atom feeds; Google Publisher Center for publisher-supplied feeds; Google Read Aloud for text-to-speech; and Google Site Verifier for Search Console verification tokens. Each has distinct user-agent strings.\n"],null,["# Google User-Triggered Fetchers | Google Search Central\n\nList of Google user-triggered fetchers\n======================================\n\n\nUser-triggered fetchers are initiated by users to perform a fetching function within a Google\nproduct. For example,\n[Google Site Verifier](https://support.google.com/webmasters/answer/9008080)\nacts on a user's request, or a site hosted on Google Cloud (GCP) has a feature that allows the\nsite's users to retrieve an external RSS feed. Because the fetch was requested by a user, these\nfetchers generally ignore robots.txt rules. The general\n[technical properties](/search/docs/crawling-indexing/overview-google-crawlers#crawl-technical-props)\nof Google's crawlers also apply to the user-triggered fetchers.\n\n\nThe IP ranges the user-triggered fetchers use are\npublished in the\n[user-triggered-fetchers.json](/static/search/apis/ipranges/user-triggered-fetchers.json) and\n[user-triggered-fetchers-google.json](/static/search/apis/ipranges/user-triggered-fetchers-google.json)\nobjects. The user-triggered fetchers' reverse DNS mask, depending on whether the fetcher is Google\nor user owned, matches `***-***-***-***.gae.googleusercontent.com` or\n`google-proxy-***-***-***-***.google.com` respectively.\n\n\nThe following list shows the user-triggered fetchers, their user agent strings as they appear in\nthe HTTP requests, and the products they are associated with. The list is not exhaustive, it only\ncovers the requestors that are more likely to show up in log files and that we've received\nquestions about.\n| **Caution** : The user agent string can be spoofed. [Learn how to verify if a visitor is a Google crawler](/search/docs/crawling-indexing/verifying-googlebot).\n\n[Feedfetcher](/search/docs/crawling-indexing/feedfetcher)\n---------------------------------------------------------\n\n:\n\n |---------------------------------|--------------------------------------------------------------------------------------|\n | **User-Agent in HTTP requests** | ``` FeedFetcher-Google; (+http://www.google.com/feedfetcher.html) ``` |\n | **Associated products** | Feedfetcher is used for crawling RSS or Atom feeds for Google News and PubSubHubbub. |\n\n\nGoogle Publisher Center\n-----------------------\n\n:\n\n |---------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\n | **User-Agent in HTTP requests** | ``` GoogleProducer; (+https://developers.google.com/search/docs/crawling-indexing/google-producer) ``` |\n | **Associated products** | Google Publisher Center fetches and processes [feeds that publishers explicitly supplied](https://support.google.com/news/publisher-center/answer/9545414) for use in Google News landing pages. |\n\n\n[Google Read Aloud](/search/docs/crawling-indexing/read-aloud-user-agent)\n-------------------------------------------------------------------------\n\n:\n\n |---------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\n | **User-Agent in HTTP requests** | |---------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Mobile agent | ``` Mozilla/5.0 (Linux; Android 10; K) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/138.0.0.0 Mobile Safari/537.36 (compatible; Google-Read-Aloud; +https://support.google.com/webmasters/answer/1061943) ``` | | Desktop agent | ``` Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/137.0.0.0 Safari/537.36 (compatible; Google-Read-Aloud; +https://support.google.com/webmasters/answer/1061943) ``` | | Former agent (deprecated) | `google-speakr` | |\n | **Associated products** | Upon user request, Google Read Aloud [fetches and reads out web pages using text-to-speech (TTS)](/search/docs/crawling-indexing/read-aloud-user-agent). |\n\n\n[Google Site Verifier](https://support.google.com/webmasters/answer/9008080)\n----------------------------------------------------------------------------\n\n:\n\n |---------------------------------|------------------------------------------------------------------|\n | **User-Agent in HTTP requests** | ``` Mozilla/5.0 (compatible; Google-Site-Verification/1.0) ``` |\n | **Associated products** | Google Site Verifier fetches Search Console verification tokens. |"]]