panfleteiro.pt
robots.txt
Robots Exclusion Standard data for panfleteiro.pt
Resource Scan
Scan Details
Site Domain | panfleteiro.pt |
Base Domain | panfleteiro.pt |
Scan Status | Ok |
Last Scan | 2024-09-27T17:08:38+00:00 |
Next Scan | 2024-10-04T17:08:38+00:00 |
Last Scan
Scanned | 2024-09-27T17:08:38+00:00 |
URL | https://panfleteiro.pt/robots.txt |
Redirect | https://www.panfleteiro.pt/robots.txt |
Redirect Domain | www.panfleteiro.pt |
Redirect Base | panfleteiro.pt |
Domain IPs | 194.4.48.39, 2a03:b0c0:2:f0::40:3001 |
Redirect IPs | 194.4.48.39, 2a03:b0c0:2:f0::40:3001 |
Response IP | 194.4.48.39 |
Found | Yes |
Hash | 515e714ecf84bb54718b7ae9b7458d7ff35df4298b2fd13c9eb80fb6fe2d3c95 |
SimHash | da29085ce912 |
Groups
*
Rule | Path |
---|---|
Disallow | /api/ |
Disallow | /site/ |
Disallow | /exit/ |
Disallow | /brochure/brochure-page/ |
Disallow | */?login-token= |
Disallow | */user-admin/* |
Disallow | */nove-heslo/ |
Disallow | */?page= |
Disallow | */offers/* |
Disallow | */detail/* |
Disallow | /27957108/* |
Disallow | /js/joined/bub.min.js |
Other Records
Field | Value |
---|---|
sitemap | https://www.panfleteiro.pt/sitemap_index.xml |