panfleteiro.pt
robots.txt
Robots Exclusion Standard data for panfleteiro.pt
Resource Scan
Scan Details
Site Domain | panfleteiro.pt |
Base Domain | panfleteiro.pt |
Scan Status | Ok |
Last Scan | 2024-11-01T18:24:54+00:00 |
Next Scan | 2024-11-08T18:24:54+00:00 |
Last Scan
Scanned | 2024-11-01T18:24:54+00:00 |
URL | https://panfleteiro.pt/robots.txt |
Redirect | https://www.panfleteiro.pt/robots.txt |
Redirect Domain | www.panfleteiro.pt |
Redirect Base | panfleteiro.pt |
Domain IPs | 194.4.48.39, 2a11:4c00:0:10::1 |
Redirect IPs | 194.4.48.39, 2a11:4c00:0:10::1 |
Response IP | 194.4.48.39 |
Found | Yes |
Hash | ec80a99827e959e9f202bc0eea027fb311feae7ebe56594a5ff4ca9db7e156f7 |
SimHash | d8294854ed52 |
Groups
*
Rule | Path |
---|---|
Disallow | /api/ |
Disallow | /site/ |
Disallow | /exit/ |
Disallow | /brochure/brochure-page/ |
Disallow | */?login-token= |
Disallow | */user-admin/* |
Disallow | */nove-heslo/ |
Disallow | */?page= |
Disallow | */offers/* |
Disallow | */detail/* |
Disallow | */detalhe/* |
Disallow | /27957108/* |
Disallow | /js/joined/bub.min.js |
Other Records
Field | Value |
---|---|
sitemap | https://www.panfleteiro.pt/sitemap_index.xml |