twentydaily.com
robots.txt
Robots Exclusion Standard data for twentydaily.com
Resource Scan
Scan Details
Site Domain | twentydaily.com |
Base Domain | twentydaily.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-09-07T03:40:26+00:00 |
Next Scan | 2024-12-06T03:40:26+00:00 |
Last Successful Scan
Scanned | 2023-02-12T09:52:24+00:00 |
URL | https://twentydaily.com/robots.txt |
Domain IPs | 104.18.26.183, 104.18.27.183, 2606:4700::6812:1ab7, 2606:4700::6812:1bb7 |
Response IP | 104.18.26.183 |
Found | Yes |
Hash | a6a4c0e5c2a3df1a864b2fe1e3d6c5e2923b255ff5e50a0a7deea45c5fcb2468 |
SimHash | 63105805a180 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Disallow | /feed |
Disallow | /*.gif$ |
Disallow | /*.jpg$ |
Disallow | /*.jpeg$ |
Disallow | /*.png$ |
Disallow | /*.tif$ |
Disallow | /*.tiff$ |
Disallow | /*.webp$ |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://twentydaily.com/sitemap.xml.gz |