somethingawful.com
robots.txt
Robots Exclusion Standard data for somethingawful.com
Resource Scan
Scan Details
Site Domain | somethingawful.com |
Base Domain | somethingawful.com |
Scan Status | Ok |
Last Scan | 2024-11-11T16:32:21+00:00 |
Next Scan | 2024-11-18T16:32:21+00:00 |
Last Scan
Scanned | 2024-11-11T16:32:21+00:00 |
URL | https://somethingawful.com/robots.txt |
Redirect | https://www.somethingawful.com/robots.txt |
Redirect Domain | www.somethingawful.com |
Redirect Base | somethingawful.com |
Domain IPs | 104.23.128.68, 104.23.129.68 |
Redirect IPs | 104.23.128.68, 104.23.129.68 |
Response IP | 104.23.128.68 |
Found | Yes |
Hash | f42b9e8bc01eedd50d6beb5fa6f8cc8819ffc2af5c0bafbc3e131b211f20334f |
SimHash | 691cd8728eb1 |
Groups
*
Rule | Path |
---|---|
Disallow | /alod/ |
Disallow | /search/ |
Disallow | /random/ |
Disallow | /hentai-game-reviews/ |
Disallow | /horrors-of-porn/ |
Disallow | /d/hentai-game-reviews/ |
Disallow | /d/horrors-of-porn/ |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | http://www.somethingawful.com/sitemap/index.xml |