finetwice.com
robots.txt
Robots Exclusion Standard data for finetwice.com
Resource Scan
Scan Details
Site Domain | finetwice.com |
Base Domain | finetwice.com |
Scan Status | Ok |
Last Scan | 2024-10-01T17:22:04+00:00 |
Next Scan | 2024-10-08T17:22:04+00:00 |
Last Scan
Scanned | 2024-10-01T17:22:04+00:00 |
URL | https://www.finetwice.com/robots.txt |
Domain IPs | 2404:6800:4003:c1c::79, 74.125.200.121 |
Response IP | 64.233.170.121 |
Found | Yes |
Hash | 77d7832a9a407d12c29838b8606f5bb6395535438a5ebebd713bd0679f25c597 |
SimHash | 4b065a446690 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /p/blog-page_7.html |
Disallow | /p/blog-page_69.html |
Disallow | /p/blog-page_49.html |
Disallow | /p/blog-page.html |
Disallow | /error_page.html |
Disallow | /search/label/ |
Disallow | /search?updated-max= |
Disallow | /search?updated-min= |
Allow | / |
Allow | /ads.txt |
Other Records
Field | Value |
---|---|
sitemap | https://www.finetwice.com/sitemap.xml |
Warnings
- 1 invalid line.