cnfirms.com
robots.txt

Robots Exclusion Standard data for cnfirms.com

Resource Scan

Scan Details

Site Domain cnfirms.com
Base Domain cnfirms.com
Scan Status Ok
Last Scan2024-06-12T17:37:10+00:00
Next Scan 2024-06-19T17:37:10+00:00

Last Scan

Scanned2024-06-12T17:37:10+00:00
URL https://cnfirms.com/robots.txt
Domain IPs 104.21.14.50, 172.67.202.56, 2606:4700:3030::6815:e32, 2606:4700:3032::ac43:ca38
Response IP 104.21.14.50
Found Yes
Hash 3e05c6ad2090a8dc38fdc7d85bc71f6b3e2bde158a1dbd1f7fa31cc6b701438b
SimHash 62384450d547

Groups

*

Rule Path
Disallow /robots-go-away/
Disallow /robots-go-away/*
Disallow /hit.png
Disallow /hit.png?*

Other Records

Field Value
sitemap https://cnfirms.com/sitemap.xml
sitemap https://cnfirms.com/mobile-sitemap.xml

Warnings

  • `noarchive` is not a known field.