camdencc.edu
robots.txt

Robots Exclusion Standard data for camdencc.edu

Resource Scan

Scan Details

Site Domain camdencc.edu
Base Domain camdencc.edu
Scan Status Ok
Last Scan2025-12-09T13:53:32+00:00
Next Scan 2026-01-08T13:53:32+00:00

Last Scan

Scanned2025-12-09T13:53:32+00:00
URL https://camdencc.edu/robots.txt
Domain IPs 170.249.207.74
Response IP 170.249.207.74
Found Yes
Hash 02c9abd107cd33352bcf5b1c1f85d35be593313e7e382a9795f268e56cb5c2e1
SimHash 711d8957a106

Groups

*

Rule Path
Disallow /wp-admin/*
Allow /wp-admin/admin-ajax.php

googlebot

Rule Path
Disallow /wp-admin/*
Allow /wp-admin/admin-ajax.php

googlebot-image

Rule Path
Disallow /wp-admin/*
Allow /wp-admin/admin-ajax.php

googlebot

Rule Path
Disallow /oh19
Disallow /signature.html

*

Rule Path
Disallow /oh19
Disallow /signature.html

seekportbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

meta-externalads

Rule Path
Disallow /events/*
Disallow /wp-content/*
Disallow /wp-includes/*
Disallow /wp-admin/*

meta-externalads/1.1

Rule Path
Disallow /events/*
Disallow /wp-content/*
Disallow /wp-includes/*
Disallow /wp-admin/*