thecrimson.com
robots.txt

Robots Exclusion Standard data for thecrimson.com

Resource Scan

Scan Details

Site Domain thecrimson.com
Base Domain thecrimson.com
Scan Status Ok
Last Scan2024-11-05T06:31:44+00:00
Next Scan 2024-11-12T06:31:44+00:00

Last Scan

Scanned2024-11-05T06:31:44+00:00
URL https://thecrimson.com/robots.txt
Redirect https://www.thecrimson.com/robots.txt
Redirect Domain www.thecrimson.com
Redirect Base thecrimson.com
Domain IPs 3.212.82.151, 3.220.119.130
Redirect IPs 3.165.102.123, 3.165.102.129, 3.165.102.58, 3.165.102.9
Response IP 3.165.102.9
Found Yes
Hash 86d88d40bc2c427d29729f0db339f3a974e7f57b3adf7398134ebdda34a27869
SimHash ec2a5335c393

Groups

hul-wax

Rule Path
Disallow

slurp

Rule Path
Disallow

Other Records

Field Value
crawl-delay 5

yahoo! slurp

Rule Path
Disallow

Other Records

Field Value
crawl-delay 5