getpocket.cdn.mozilla.net
robots.txt

Robots Exclusion Standard data for getpocket.cdn.mozilla.net

Resource Scan

Scan Details

Site Domain getpocket.cdn.mozilla.net
Base Domain mozilla.net
Scan Status Ok
Last Scan2024-05-22T09:52:00+00:00
Next Scan 2024-06-05T09:52:00+00:00

Last Scan

Scanned2024-05-22T09:52:00+00:00
URL https://getpocket.cdn.mozilla.net/robots.txt
Domain IPs 2600:1901:0:524c::, 34.120.5.221
Response IP 34.120.5.221
Found Yes
Hash 589dd1d574e59658e881f277336c7a4f3a0fb53b80375ac9cd1af3c85f59a8a2
SimHash e4105619c3b7

Groups

*

Rule Path
Disallow /v2/*

*

Rule Path
Disallow /v3/*

*

Rule Path
Disallow /create*

*

Rule Path
Disallow /mini_login*

*

Rule Path
Disallow /button*

*

Rule Path
Disallow /addemail*

*

Rule Path
Disallow /redirect*

*

Rule Path
Disallow /email_unsubscribe*

*

Rule Path
Disallow /firefox/new_tab_learn_more*

*

Rule Path
Disallow /s/*

*

Rule Path
Disallow /read/*

*

Rule Path
Disallow /edit*
Disallow /save*

Other Records

Field Value
crawl-delay 2

Comments

  • Crawl-delay is non-standard and is interpreted differently between different
  • search engines. 2 *should* be a low enough value to not disrupt our current SEO