punchfork.com
robots.txt
Robots Exclusion Standard data for punchfork.com
Resource Scan
Scan Details
Site Domain | punchfork.com |
Base Domain | punchfork.com |
Scan Status | Ok |
Last Scan | 2024-10-02T09:45:17+00:00 |
Next Scan | 2024-11-01T09:45:17+00:00 |
Last Scan
Scanned | 2024-10-02T09:45:17+00:00 |
URL | https://punchfork.com/robots.txt |
Redirect | https://www.punchfork.com/robots.txt |
Redirect Domain | www.punchfork.com |
Redirect Base | punchfork.com |
Domain IPs | 3.16.87.35 |
Redirect IPs | 3.16.87.35 |
Response IP | 3.16.87.35 |
Found | Yes |
Hash | 74d720850de44714134063d9450a767cf75bd1cb00a3e22bd668cca9197efb8d |
SimHash | 431f9cd2ea9c |
Groups
*
Rule | Path |
---|---|
Disallow | /r/ |
Disallow | /unsubscribe |
Disallow | /ocu |
Disallow | /wiring/ |
Disallow | /*/totaltime/ |
Disallow | /*/new/diet/ |
Disallow | /*/top/diet/ |
Disallow | /*/trending/diet/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.punchfork.com/sitemap.xml |