consolecrunch.org
robots.txt

Robots Exclusion Standard data for consolecrunch.org

Resource Scan

Scan Details

Site Domain consolecrunch.org
Base Domain consolecrunch.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-08-28T21:31:20+00:00
Next Scan 2024-11-26T21:31:20+00:00

Last Successful Scan

Scanned2023-10-11T21:28:33+00:00
URL https://consolecrunch.org/robots.txt
Domain IPs 104.21.54.221, 172.67.142.203, 2606:4700:3030::ac43:8ecb, 2606:4700:3035::6815:36dd
Response IP 104.21.54.221
Found Yes
Hash 0f7101a492f4a2d28eac40ee330aa058fdac899bd19c41c1841027244b13ef60
SimHash 2952a3375691

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/
Allow /amp

googlebot-mobile

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

msnbot

Rule Path
Allow /

yahoo pipes 1.0

Rule Path
Allow /

yahoo! slurp

Rule Path
Allow /

baiduspider

Rule Path
Allow /

baiduspider-news

Rule Path
Allow /

baiduspider-image

Rule Path
Allow /

yandexbot

Rule Path
Allow /

yandeximages

Rule Path
Allow /

yandexnews

Rule Path
Allow /

yandexwebmaster

Rule Path
Allow /

yandexpagechecker

Rule Path
Allow /

zyborg

Rule Path
Allow /

exabot

Rule Path
Allow /

facebot

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

archive.org_bot

Rule Path
Allow /

architextspider

Rule Path
Allow /

feedfetcher-google

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://consolecrunch.org/sitemap.xml

Warnings

  • 2 invalid lines.