howtohow.org
robots.txt
Robots Exclusion Standard data for howtohow.org
Resource Scan
Scan Details
Site Domain | howtohow.org |
Base Domain | howtohow.org |
Scan Status | Ok |
Last Scan | 2024-10-30T09:08:30+00:00 |
Next Scan | 2024-11-06T09:08:30+00:00 |
Last Scan
Scanned | 2024-10-30T09:08:30+00:00 |
URL | https://howtohow.org/robots.txt |
Domain IPs | 104.21.53.163, 172.67.215.42, 2606:4700:3031::ac43:d72a, 2606:4700:3037::6815:35a3 |
Response IP | 104.21.53.163 |
Found | Yes |
Hash | 008b3897c7e50ca8436da455c9216c1a7366ff1df9255f3d15776be48325cd23 |
SimHash | c84cdcc0a01b |
Groups
*
Rule | Path |
---|---|
Disallow | /?s= |
Disallow | /page/*/?s= |
Disallow | /search/ |
Disallow | /wp-json/ |
Disallow | /?rest_route= |
Other Records
Field | Value |
---|---|
sitemap | https://howtohow.org/sitemap_index.xml |
Comments