the-reference.com
robots.txt
Robots Exclusion Standard data for the-reference.com
Resource Scan
Scan Details
| Site Domain | the-reference.com |
| Base Domain | the-reference.com |
| Scan Status | Ok |
| Last Scan | 2025-12-05T02:11:32+00:00 |
| Next Scan | 2026-01-04T02:11:32+00:00 |
Last Scan
| Scanned | 2025-12-05T02:11:32+00:00 |
| URL | https://the-reference.com/robots.txt |
| Redirect | https://www.the-reference.com/robots.txt |
| Redirect Domain | www.the-reference.com |
| Redirect Base | the-reference.com |
| Domain IPs | 13.69.68.31, 23.100.1.29 |
| Redirect IPs | 23.100.1.29 |
| Response IP | 23.100.1.29 |
| Found | Yes |
| Hash | 81fd42d7e77f5922d8de24ab653fe84c3ee1910e6425b674e835e84bbde0ca1a |
| SimHash | 71001b474f30 |
Groups
*
| Rule | Path |
|---|---|
| Allow | /DependencyHandler.axd |
| Disallow | /aspnet_client/ |
| Disallow | /bin/ |
| Disallow | /config/ |
| Disallow | /umbraco/ |
| Disallow | /umbraco_client/ |
| Disallow | /usercontrols/ |
| Disallow | /*.axd |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.the-reference.com/sitemap.xml |
Comments