dirtrider.com
robots.txt

Robots Exclusion Standard data for dirtrider.com

Resource Scan

Scan Details

Site Domain dirtrider.com
Base Domain dirtrider.com
Scan Status Ok
Last Scan2024-09-27T03:42:01+00:00
Next Scan 2024-10-04T03:42:01+00:00

Last Scan

Scanned2024-09-27T03:42:01+00:00
URL https://dirtrider.com/robots.txt
Redirect https://www.dirtrider.com:443/robots.txt
Redirect Domain www.dirtrider.com
Redirect Base dirtrider.com
Domain IPs 15.197.174.213, 3.33.166.34
Redirect IPs 23.49.60.58, 23.49.60.65, 2600:1413:b000:13::b857:c18e, 2600:1413:b000:13::b857:c1a2
Response IP 23.52.171.130
Found Yes
Hash b9bf6799573b0357a3e14b69c25654210022f1726eaff623ac070796c40b9840
SimHash 8c06da702503

Groups

gigabot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

nutch

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

asterias

Rule Path
Disallow /

*

Rule Path
Disallow /au/
Disallow /ca/
Disallow /fr/
Disallow /ca/
Disallow /fr/
Disallow /de/
Disallow /in/
Disallow /it/
Disallow /jp/
Disallow /mx/
Disallow /es/
Disallow /uk/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.dirtrider.com/arcio/sitemap-index/index/
sitemap https://www.dirtrider.com/arcio/fronts-sitemap/

Comments

  • Disallow the following spiders