cpasitesolutions.com
robots.txt

Robots Exclusion Standard data for cpasitesolutions.com

Resource Scan

Scan Details

Site Domain cpasitesolutions.com
Base Domain cpasitesolutions.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-08T13:10:42+00:00
Next Scan 2024-10-22T13:10:42+00:00

Last Successful Scan

Scanned2024-08-31T12:41:47+00:00
URL https://cpasitesolutions.com/robots.txt
Redirect https://www.cpasitesolutions.com/file_distribution/robots.php
Redirect Domain www.cpasitesolutions.com
Redirect Base cpasitesolutions.com
Domain IPs 54.148.47.112
Redirect IPs 54.148.47.112
Response IP 54.148.47.112
Found Yes
Hash 26f59d5be3960a985a6efe48908446217825b690ed0662dc9b93d3eac90ea03f
SimHash 2f05d051c575

Groups

googlebot
adsbot-google
googlebot-image
mediapartners-google
adsbot
slurp
yandex
twitterbot
baiduspider
bingbot
msnbot
yahoo
ia_archiver
rogerbot
ravencrawler
amazonbot
facebookexternalhit
facebookexternalhit/1.1
facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php)
linkedinbot
linkedinbot/1.0
linkedinbot/1.0 (compatible; mozilla/5.0; apache-httpclient +http://www.linkedin.com)

Rule Path
Disallow /~*
Disallow /*?*
Disallow
Allow /*?utm_*

Other Records

Field Value
crawl-delay 5

*

Product Comment
* Disallow everyone else
Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cpasitesolutions.com/sitemap.xml

Comments

  • Allow google, bing, msn, yahoo, amazon