archives.bulbagarden.net
robots.txt

Robots Exclusion Standard data for archives.bulbagarden.net

Resource Scan

Scan Details

Site Domain archives.bulbagarden.net
Base Domain bulbagarden.net
Scan Status Ok
Last Scan2024-06-02T12:06:19+00:00
Next Scan 2024-07-02T12:06:19+00:00

Last Scan

Scanned2024-06-02T12:06:19+00:00
URL https://archives.bulbagarden.net/robots.txt
Domain IPs 104.21.233.239, 104.21.233.240, 2606:4700:3038::6815:e9ef, 2606:4700:3038::6815:e9f0
Response IP 104.21.233.239
Found Yes
Hash 4b33bfca063b27c4d1fa83f8b60b3ca24085de69d1144917d122b490562649cd
SimHash 3134d850fbf7

Groups

mediapartners-google*

Rule Path
Disallow

*

Rule Path
Disallow /wiki/Special%3ASearch
Disallow /wiki/Special%3ASearch
Disallow /wiki/Special%3ARandompage
Disallow /wiki/Special%3ARandompage

Other Records

Field Value
crawl-delay 1

Comments

  • advertising-related bots: