classicarlington.com
robots.txt
Robots Exclusion Standard data for classicarlington.com
Resource Scan
Scan Details
Site Domain | classicarlington.com |
Base Domain | classicarlington.com |
Scan Status | Ok |
Last Scan | 2024-10-07T03:13:52+00:00 |
Next Scan | 2024-11-06T03:13:52+00:00 |
Last Scan
Scanned | 2024-10-07T03:13:52+00:00 |
URL | https://classicarlington.com/robots.txt |
Redirect | https://www.classicarlington.com/robots.txt |
Redirect Domain | www.classicarlington.com |
Redirect Base | classicarlington.com |
Domain IPs | 216.241.213.55 |
Redirect IPs | 13.33.88.104, 13.33.88.41, 13.33.88.88, 13.33.88.92 |
Response IP | 13.33.88.41 |
Found | Yes |
Hash | b3253f9370a2b4d8de3ad30cae884133decf9ae32b3d6d7d7b6f593c87a7f655 |
SimHash | 5b155090c6b0 |
Groups
googlebot
googlebot-mobile
adsbot-google
Rule | Path |
---|---|
Disallow | /siteMap |
Disallow | /*.do |
Disallow | /*.ajax |
Disallow | /*.js |
Disallow | /*.css |
Disallow | /f_* |
bingbot
applebot
msnbot
adidxbot
motominerbot
mj12bot
rogerbot
ravencrawler
twitterbot
slurp
duckduckbot
semrushbot
teoma
archive.org_bot
Rule | Path |
---|---|
Disallow | /siteMap |
Disallow | /*.do |
Disallow | /*.ajax |
Disallow | /*.js |
Disallow | /*.css |
Disallow | /f_* |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
*
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.classicarlington.com/sitemap.xml |
sitemap | https://www.idostream.com/sitemaps/3865/sitemap1/video-sitemap.xml |