pariscom2030.com
robots.txt
Robots Exclusion Standard data for pariscom2030.com
Resource Scan
Scan Details
Site Domain | pariscom2030.com |
Base Domain | pariscom2030.com |
Scan Status | Ok |
Last Scan | 2024-11-10T13:19:43+00:00 |
Next Scan | 2024-11-24T13:19:43+00:00 |
Last Scan
Scanned | 2024-11-10T13:19:43+00:00 |
URL | https://pariscom2030.com/robots.txt |
Domain IPs | 104.21.88.151, 172.67.223.176, 2606:4700:3036::ac43:dfb0, 2606:4700:3037::6815:5897 |
Response IP | 104.21.88.151 |
Found | Yes |
Hash | 00553f57225f8c3a37405111456038e3937175940f780a5d6f4b2e62233ee18e |
SimHash | 63476f7529a7 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /cart |
Disallow | /order-completed/*/invoice |
Disallow | /orders/*/invoice |
Disallow | /products/*/reviews |
Disallow | /products/*/reviews/add |
Disallow | /o/*/invoice |
Disallow | /products?search=* |
Warnings
- 20 invalid lines.