actblogs.com
robots.txt
Robots Exclusion Standard data for actblogs.com
Resource Scan
Scan Details
Site Domain | actblogs.com |
Base Domain | actblogs.com |
Scan Status | Ok |
Last Scan | 2024-09-23T22:21:30+00:00 |
Next Scan | 2024-09-30T22:21:30+00:00 |
Last Scan
Scanned | 2024-09-23T22:21:30+00:00 |
URL | https://actblogs.com/robots.txt |
Domain IPs | 104.21.42.95, 172.67.204.187, 2606:4700:3031::ac43:ccbb, 2606:4700:3037::6815:2a5f |
Response IP | 104.21.42.95 |
Found | Yes |
Hash | 609a9f7301f62822159203c8460825c13f7bfae752e1df08e1427ebc0fb49f0c |
SimHash | 69110b0476c1 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /tmp/ |
Disallow | /junk/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.actblogs.com/sitemap_index.xml |