cnfirms.com
robots.txt
Robots Exclusion Standard data for cnfirms.com
Resource Scan
Scan Details
Site Domain | cnfirms.com |
Base Domain | cnfirms.com |
Scan Status | Ok |
Last Scan | 2024-06-12T17:37:10+00:00 |
Next Scan | 2024-06-19T17:37:10+00:00 |
Last Scan
Scanned | 2024-06-12T17:37:10+00:00 |
URL | https://cnfirms.com/robots.txt |
Domain IPs | 104.21.14.50, 172.67.202.56, 2606:4700:3030::6815:e32, 2606:4700:3032::ac43:ca38 |
Response IP | 104.21.14.50 |
Found | Yes |
Hash | 3e05c6ad2090a8dc38fdc7d85bc71f6b3e2bde158a1dbd1f7fa31cc6b701438b |
SimHash | 62384450d547 |
Groups
*
Rule | Path |
---|---|
Disallow | /robots-go-away/ |
Disallow | /robots-go-away/* |
Disallow | /hit.png |
Disallow | /hit.png?* |
Other Records
Field | Value |
---|---|
sitemap | https://cnfirms.com/sitemap.xml |
sitemap | https://cnfirms.com/mobile-sitemap.xml |
Warnings
- `noarchive` is not a known field.