arlingtoncatholiccharities.com
robots.txt

Robots Exclusion Standard data for arlingtoncatholiccharities.com

Resource Scan

Scan Details

Site Domain arlingtoncatholiccharities.com
Base Domain arlingtoncatholiccharities.com
Scan Status Ok
Last Scan2026-02-17T01:41:37+00:00
Next Scan 2026-03-19T01:41:37+00:00

Last Scan

Scanned2026-02-17T01:41:37+00:00
URL https://arlingtoncatholiccharities.com/robots.txt
Redirect https://www.arlingtoncatholiccharities.com/robots.txt
Redirect Domain www.arlingtoncatholiccharities.com
Redirect Base arlingtoncatholiccharities.com
Domain IPs 104.21.48.28, 172.67.176.36, 2606:4700:3030::ac43:b024, 2606:4700:3033::6815:301c
Redirect IPs 104.21.48.28, 172.67.176.36, 2606:4700:3030::ac43:b024, 2606:4700:3033::6815:301c
Response IP 172.67.176.36
Found Yes
Hash 8b1f7ff490749820d235290b48bf5b8399f238bdd8c8fdeecbd4e33eac4c402b
SimHash 68dc518880c0

Groups

*

Rule Path
Allow /
Allow /wp-content/*.css
Allow /wp-includes/*.js
Disallow /cgi-bin
Disallow /wp-includes/
Disallow /tag/*/page/
Disallow /page/
Disallow /?attachment_id*
Disallow /*trackback
Disallow /*trackback*
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

slurpbot

Rule Path
Disallow /Sitemap

baiduspider

Rule Path
Disallow /Sitemap

googlebot

Rule Path
Disallow /Sitemap

bingbot

Rule Path
Disallow /Sitemap

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

teleport

Rule Path
Disallow /

linko

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.arlingtoncatholiccharities.com/sitemapindex.xml