theins.press
robots.txt
Robots Exclusion Standard data for theins.press
Resource Scan
Scan Details
| Site Domain | theins.press |
| Base Domain | theins.press |
| Scan Status | Ok |
| Last Scan | 2025-11-02T03:21:32+00:00 |
| Next Scan | 2025-11-09T03:21:32+00:00 |
Last Scan
| Scanned | 2025-11-02T03:21:32+00:00 |
| URL | https://theins.press/robots.txt |
| Domain IPs | 104.21.84.230, 172.67.198.94, 2606:4700:3033::ac43:c65e, 2606:4700:3034::6815:54e6 |
| Response IP | 172.67.198.94 |
| Found | Yes |
| Hash | f3cff7631a531eed286724520f4e1968582b10b881c49e92e6d9865734100943 |
| SimHash | 6d951c80efd2 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /search |
| Disallow | /tags/ |
| Disallow | /tag/ |
| Disallow | /admin/ |
| Disallow | /wp- |
| Disallow | /amp/news/%7B%7Bterm_* |
| Disallow | /secret |
Other Records
| Field | Value |
|---|---|
| sitemap | https://theins.ru/sitemap.xml |