arlingtoncatholiccharities.com
robots.txt
Robots Exclusion Standard data for arlingtoncatholiccharities.com
Resource Scan
Scan Details
| Site Domain | arlingtoncatholiccharities.com |
| Base Domain | arlingtoncatholiccharities.com |
| Scan Status | Ok |
| Last Scan | 2026-02-17T01:41:37+00:00 |
| Next Scan | 2026-03-19T01:41:37+00:00 |
Last Scan
| Scanned | 2026-02-17T01:41:37+00:00 |
| URL | https://arlingtoncatholiccharities.com/robots.txt |
| Redirect | https://www.arlingtoncatholiccharities.com/robots.txt |
| Redirect Domain | www.arlingtoncatholiccharities.com |
| Redirect Base | arlingtoncatholiccharities.com |
| Domain IPs | 104.21.48.28, 172.67.176.36, 2606:4700:3030::ac43:b024, 2606:4700:3033::6815:301c |
| Redirect IPs | 104.21.48.28, 172.67.176.36, 2606:4700:3030::ac43:b024, 2606:4700:3033::6815:301c |
| Response IP | 172.67.176.36 |
| Found | Yes |
| Hash | 8b1f7ff490749820d235290b48bf5b8399f238bdd8c8fdeecbd4e33eac4c402b |
| SimHash | 68dc518880c0 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
| Allow | /wp-content/*.css |
| Allow | /wp-includes/*.js |
| Disallow | /cgi-bin |
| Disallow | /wp-includes/ |
| Disallow | /tag/*/page/ |
| Disallow | /page/ |
| Disallow | /?attachment_id* |
| Disallow | /*trackback |
| Disallow | /*trackback* |
| Disallow | /comments/feed/ |
| Disallow | /*/feed/$ |
| Disallow | /*/trackback/$ |
| Disallow | /*/*/feed/$ |
| Disallow | /*/*/feed/rss/$ |
| Disallow | /*/*/trackback/$ |
| Disallow | /*/*/*/feed/$ |
| Disallow | /*/*/*/feed/rss/$ |
| Disallow | /*/*/*/trackback/$ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.arlingtoncatholiccharities.com/sitemapindex.xml |