guance136.com
robots.txt
Robots Exclusion Standard data for guance136.com
Resource Scan
Scan Details
| Site Domain | guance136.com |
| Base Domain | guance136.com |
| Scan Status | Ok |
| Last Scan | 2026-02-05T19:26:19+00:00 |
| Next Scan | 2026-03-07T19:26:19+00:00 |
Last Scan
| Scanned | 2026-02-05T19:26:19+00:00 |
| URL | http://guance136.com/robots.txt |
| Domain IPs | 183.136.138.176 |
| Response IP | 183.136.138.176 |
| Found | Yes |
| Hash | cc10b95403c9f263d6d011a87adac44529771304f338f32af4810fdabcf44db6 |
| SimHash | 21164e62c2a6 |
Groups
*
| Rule | Path |
|---|---|
| Allow | / |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 2 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://guance136.com/sitemap.xml |
| sitemap | https://guance136.com/news-sitemap.xml |
Warnings
- `cache-control` is not a known field.
- `host` is not a known field.
- `x-robots-tag` is not a known field.
Comments