cmswire.com
robots.txt

Robots Exclusion Standard data for cmswire.com

Resource Scan

Scan Details

Site Domain cmswire.com
Base Domain cmswire.com
Scan Status Ok
Last Scan2024-09-22T16:47:13+00:00
Next Scan 2024-09-29T16:47:13+00:00

Last Scan

Scanned2024-09-22T16:47:13+00:00
URL https://cmswire.com/robots.txt
Redirect https://www.cmswire.com/robots.txt
Redirect Domain www.cmswire.com
Redirect Base cmswire.com
Domain IPs 104.20.92.185, 104.20.93.185, 2606:4700:10::6814:5cb9, 2606:4700:10::6814:5db9
Redirect IPs 104.20.92.185, 104.20.93.185, 2606:4700:10::6814:5cb9, 2606:4700:10::6814:5db9
Response IP 104.20.92.185
Found Yes
Hash 532c54e78cc4577c31554d3ead6a8663cfdbacaa7b9a5e1ee035f483dc544853
SimHash 130549741217

Groups

*
chatgpt-user

Rule Path
Disallow /error-*
Disallow /images/
Disallow /d/Organization/Item*
Disallow /1003060/
Disallow /archives/
Disallow /click/
Disallow /taf.php
Disallow /mt-static/
Disallow /shared/
Disallow /tool*/
Disallow /cgi-bin
Disallow /event/
Disallow /utils/
Disallow /preview/
Disallow /api/latest-articles/*
Disallow /cdn-cgi/challenge-platform/*
Disallow /private/*

googlebot-news

Rule Path
Disallow /events/
Disallow /webinars/
Disallow /research/
Disallow /featured/
Disallow /d/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cmswire.com/sitemap.xml
sitemap https://www.cmswire.com/d/sitemap.xml
sitemap https://www.cmswire.com/sitemap-gnews.xml

Comments

  • robots.txt for http://www.cmswire.com/
  • Updated: 29-Feb-2024
  • END