themuck.org
robots.txt
Robots Exclusion Standard data for themuck.org
Resource Scan
Scan Details
Site Domain | themuck.org |
Base Domain | themuck.org |
Scan Status | Ok |
Last Scan | 2025-08-14T17:25:30+00:00 |
Next Scan | 2025-09-13T17:25:30+00:00 |
Last Scan
Scanned | 2025-08-14T17:25:30+00:00 |
URL | https://themuck.org/robots.txt |
Domain IPs | 198.185.159.144, 198.185.159.145, 198.49.23.144, 198.49.23.145 |
Response IP | 198.49.23.145 |
Found | Yes |
Hash | 93d7fd755d02ba018960a6fc577e4317762b25b0e5cd9d9b20f607776c0155e8 |
SimHash | 15901d4ac880 |
Groups
amazonbot
anthropic-ai
applebot-extended
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
duckassistbot
facebookbot
google-cloudvertexbot
google-extended
gptbot
meta-externalagent
meta-externalagent
perplexitybot
quora-bot
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
*
Rule | Path |
---|---|
Disallow | /config |
Disallow | /search |
Disallow | /account$ |
Disallow | /account/ |
Disallow | /commerce/digital-download/ |
Disallow | /api/ |
Allow | /api/ui-extensions/ |
Disallow | /static/ |
Disallow | /*?author=* |
Disallow | /*%26author%3D* |
Disallow | /*?tag=* |
Disallow | /*%26tag%3D* |
Disallow | /*?month=* |
Disallow | /*%26month%3D* |
Disallow | /*?view=* |
Disallow | /*%26view%3D* |
Disallow | /*?format=json |
Disallow | /*%26format%3Djson |
Disallow | /*?format=page-context |
Disallow | /*%26format%3Dpage-context |
Disallow | /*?format=main-content |
Disallow | /*%26format%3Dmain-content |
Disallow | /*?format=json-pretty |
Disallow | /*%26format%3Djson-pretty |
Disallow | /*?format=ical |
Disallow | /*%26format%3Dical |
Disallow | /*?reversePaginate=* |
Disallow | /*%26reversePaginate%3D* |
Other Records
Field | Value |
---|---|
sitemap | https://themuck.org/sitemap.xml |
Comments