collegepublishing.sagepub.com
robots.txt

Robots Exclusion Standard data for collegepublishing.sagepub.com

Resource Scan

Scan Details

Site Domain collegepublishing.sagepub.com
Base Domain sagepub.com
Scan Status Ok
Last Scan2025-06-01T23:37:20+00:00
Next Scan 2025-07-01T23:37:20+00:00

Last Scan

Scanned2025-06-01T23:37:20+00:00
URL https://collegepublishing.sagepub.com/robots.txt
Domain IPs 13.91.122.207
Response IP 13.91.122.207
Found Yes
Hash ff7b13cbd2ab9c745e3768dd58c6c6e7a0b3e694a7ccbbf22aad38e23d7af9e0
SimHash 601c1944a015

Groups

*

Rule Path
Disallow /sitefinity/
Disallow /Sitefinity/

yandex
ccbot
chatgpt-user
gptbot
google-extended
anthropic-ai
claudebot
omgilibot
omgili
facebookbot
diffbot
bytespider
imagesiftbot
cohere-ai

Rule Path
Disallow /

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

Other Records

Field Value
sitemap https://collegepublishing.sagepub.com/sitemap/sitemap.gz