troprockin.com
robots.txt

Robots Exclusion Standard data for troprockin.com

Resource Scan

Scan Details

Site Domain troprockin.com
Base Domain troprockin.com
Scan Status Ok
Last Scan2025-10-18T19:29:54+00:00
Next Scan 2025-10-25T19:29:54+00:00

Last Scan

Scanned2025-10-18T19:29:54+00:00
URL https://troprockin.com/robots.txt
Domain IPs 104.21.68.230, 172.67.199.130, 2606:4700:3032::6815:44e6, 2606:4700:3036::ac43:c782
Response IP 104.21.68.230
Found Yes
Hash 7d5419e2b78af21b1e85830209a952de8c2a599409d298899a54e6afe380bab4
SimHash 50028972e9b2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /calendar/action~posterboard/
Disallow /calendar/action~agenda/
Disallow /calendar/action~oneday/
Disallow /calendar/action~month/
Disallow /calendar/action~week/
Disallow /calendar/action~stream/
Disallow /calendar/action~undefined/
Disallow /calendar/action~http%3A/
Disallow /calendar/action~default/
Disallow /calendar/action~poster/
Disallow /calendar/action~*/
Disallow /*controller%3Dai1ec_exporter_controller*
Disallow /*/action~*/

piplbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /
Disallow /

omgili

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

Warnings

  • `user agent` is not a known field.