100heuristics.tumblr.com
robots.txt
Robots Exclusion Standard data for 100heuristics.tumblr.com
Resource Scan
Scan Details
Site Domain | 100heuristics.tumblr.com |
Base Domain | tumblr.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-11-02T01:30:57+00:00 |
Next Scan | 2025-01-01T01:30:57+00:00 |
Last Successful Scan
Scanned | 2024-08-26T21:00:44+00:00 |
URL | https://100heuristics.tumblr.com/robots.txt |
Domain IPs | 74.114.154.18, 74.114.154.22 |
Response IP | 74.114.154.18 |
Found | Yes |
Hash | 73d70bc9cad1221fcf91c51611ff557006342d639a1edc45eebbcc8f9751b7f8 |
SimHash | 6b9cd8438406 |
Groups
*
Rule | Path |
---|---|
Disallow | /random |
Disallow | /day |
Disallow | /sticky-ad-iframe.html |
Disallow | /privacy/consent |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | https://100heuristics.tumblr.com/sitemap.xml |
Comments