allsides.com
robots.txt
Robots Exclusion Standard data for allsides.com
Resource Scan
Scan Details
Site Domain | allsides.com |
Base Domain | allsides.com |
Scan Status | Ok |
Last Scan | 2025-10-18T16:28:51+00:00 |
Next Scan | 2025-10-25T16:28:51+00:00 |
Last Scan
Scanned | 2025-10-18T16:28:51+00:00 |
URL | https://allsides.com/robots.txt |
Domain IPs | 104.20.26.22, 172.66.169.97, 2606:4700:10::6814:1a16, 2606:4700:10::ac42:a961 |
Response IP | 172.66.169.97 |
Found | Yes |
Hash | f859d69a07ce8c9037329cd11e9f10d13f36d9896b045f689cc64c109c54c9aa |
SimHash | 44350953c5d4 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Allow | / |
Disallow | /admin/ |
Disallow | /cat/ |
Disallow | /key/ |
Disallow | /*? |
Disallow | /*.inc$ |
Disallow | /cgi-bin |
Warnings
- `content-signal` is not a known field.
Comments