catholic.org
robots.txt

Robots Exclusion Standard data for catholic.org

Resource Scan

Scan Details

Site Domain catholic.org
Base Domain catholic.org
Scan Status Ok
Last Scan2024-11-16T08:57:09+00:00
Next Scan 2024-11-23T08:57:09+00:00

Last Scan

Scanned2024-11-16T08:57:09+00:00
URL https://catholic.org/robots.txt
Redirect https://www.catholic.org/robots.txt
Redirect Domain www.catholic.org
Redirect Base catholic.org
Domain IPs 69.16.233.11
Redirect IPs 69.16.233.11
Response IP 69.16.233.11
Found Yes
Hash 3717ce25b08af97636d8583adc3e10257ebbcd889dd643c0234e00da421ba9ba
SimHash 3b11c90075c3

Groups

*

Rule Path
Disallow /ach/
Disallow /adv/
Disallow /beta/
Disallow /blast/
Disallow /chaletmagnificat/
Disallow /chat/
Disallow /custshop/
Disallow /directory/
Disallow /eric/
Disallow /Excite/
Disallow /forward_email/
Disallow /gift_pope/
Disallow /html/
Disallow /iframe/
Disallow /images/
Disallow /includes/
Disallow /nav/
Disallow /phpframedirect/
Disallow /poll/
Disallow /pop/
Disallow /pope_slide/
Disallow /openx/
Disallow /dead/
Disallow /yui/
Disallow /recordReads.php
Disallow /cellphone/
Disallow /submit_comments.php
Disallow /prayers/candle/