flyerbox.ca
robots.txt
Robots Exclusion Standard data for flyerbox.ca
Resource Scan
Scan Details
Site Domain | flyerbox.ca |
Base Domain | flyerbox.ca |
Scan Status | Ok |
Last Scan | 2024-09-29T08:46:04+00:00 |
Next Scan | 2024-10-06T08:46:04+00:00 |
Last Scan
Scanned | 2024-09-29T08:46:04+00:00 |
URL | https://flyerbox.ca/robots.txt |
Redirect | https://www.flyerbox.ca/robots.txt |
Redirect Domain | www.flyerbox.ca |
Redirect Base | flyerbox.ca |
Domain IPs | 147.182.158.16, 2604:a880:cad:d0::cfa:c007 |
Redirect IPs | 147.182.158.16, 2604:a880:cad:d0::cfa:c007 |
Response IP | 147.182.158.16 |
Found | Yes |
Hash | e93057b0d3b8dacbeab6e320c7b96b37aae62d2dec77524e6ae87dd7639c6172 |
SimHash | f869084ce992 |
Groups
*
Rule | Path |
---|---|
Disallow | /api/ |
Disallow | /site/ |
Disallow | /exit/ |
Disallow | /brochure/brochure-page/ |
Disallow | */?login-token= |
Disallow | */user-admin/* |
Disallow | */nove-heslo/ |
Disallow | */?page= |
Disallow | */offers/* |
Disallow | */detail/* |
Disallow | /27957108/* |
Disallow | /js/joined/bub.min.js |
Other Records
Field | Value |
---|---|
sitemap | https://www.flyerbox.ca/sitemap_index.xml |