whcsolar.com
robots.txt

Robots Exclusion Standard data for whcsolar.com

Resource Scan

Scan Details

Site Domain whcsolar.com
Base Domain whcsolar.com
Scan Status Ok
Last Scan2025-11-02T02:56:57+00:00
Next Scan 2025-12-02T02:56:57+00:00

Last Scan

Scanned2025-11-02T02:56:57+00:00
URL https://whcsolar.com/robots.txt
Domain IPs 172.66.40.211, 172.66.43.45, 2606:4700:3108::ac42:28d3, 2606:4700:3108::ac42:2b2d
Response IP 172.66.43.45
Found Yes
Hash 65a086de2f44a54a3154e5d149febf3a5db1acc9c97b8f49ae1b0051121f4a53
SimHash 6a08c84083a2

Groups

bingbot

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 120

Other Records

Field Value
sitemap https://www.whcsolar.com/sitemap_index.xml

Comments

  • 将爬取速度限制为每120秒一次