cpasitesolutions.com
robots.txt
Robots Exclusion Standard data for cpasitesolutions.com
Resource Scan
Scan Details
Site Domain | cpasitesolutions.com |
Base Domain | cpasitesolutions.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-10-08T13:10:42+00:00 |
Next Scan | 2024-10-22T13:10:42+00:00 |
Last Successful Scan
Scanned | 2024-08-31T12:41:47+00:00 |
URL | https://cpasitesolutions.com/robots.txt |
Redirect | https://www.cpasitesolutions.com/file_distribution/robots.php |
Redirect Domain | www.cpasitesolutions.com |
Redirect Base | cpasitesolutions.com |
Domain IPs | 54.148.47.112 |
Redirect IPs | 54.148.47.112 |
Response IP | 54.148.47.112 |
Found | Yes |
Hash | 26f59d5be3960a985a6efe48908446217825b690ed0662dc9b93d3eac90ea03f |
SimHash | 2f05d051c575 |
Groups
googlebot
adsbot-google
googlebot-image
mediapartners-google
adsbot
slurp
yandex
twitterbot
baiduspider
bingbot
msnbot
yahoo
ia_archiver
rogerbot
ravencrawler
amazonbot
facebookexternalhit
facebookexternalhit/1.1
facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php)
linkedinbot
linkedinbot/1.0
linkedinbot/1.0 (compatible; mozilla/5.0; apache-httpclient +http://www.linkedin.com)
Rule | Path |
---|---|
Disallow | /~* |
Disallow | /*?* |
Disallow | |
Allow | /*?utm_* |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
*
Product | Comment |
---|---|
* | Disallow everyone else |
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.cpasitesolutions.com/sitemap.xml |
Comments