liguemidgetaaa.ca
robots.txt
Robots Exclusion Standard data for liguemidgetaaa.ca
Resource Scan
Scan Details
Site Domain | liguemidgetaaa.ca |
Base Domain | liguemidgetaaa.ca |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-25T18:10:56+00:00 |
Next Scan | 2024-12-24T18:10:56+00:00 |
Last Successful Scan
Scanned | 2024-02-29T18:03:57+00:00 |
URL | https://www.liguemidgetaaa.ca/robots.txt |
Redirect | https://www.m18aaa.com/robots.txt |
Redirect Domain | www.m18aaa.com |
Redirect Base | m18aaa.com |
Domain IPs | 104.21.16.161, 172.67.214.188, 2606:4700:3033::ac43:d6bc, 2606:4700:3037::6815:10a1 |
Redirect IPs | 104.21.1.152, 172.67.129.114, 2606:4700:3033::ac43:8172, 2606:4700:3037::6815:198 |
Response IP | 172.67.129.114 |
Found | Yes |
Hash | bf6e0be40e8f3be62012999107415c6da0d26a6b4a7dbf61c7df79a73bb9449b |
SimHash | 2c3eddf07609 |
Groups
*
Rule | Path |
---|---|
Disallow | /*%7B%7B |
Disallow | /*%7B%7B |
Disallow | /*?SID= |
Disallow | /*?no_cache= |
Disallow | /*?nocache= |
Disallow | /tmp/ |
Disallow | /vDev/ |
Disallow | /vPreprod/ |
Disallow | /webmailAPIs/ |
Disallow | /ctr/ |
Disallow | /sponsors/ |
Disallow | /adpics/ |
Disallow | /vProd/iframeSession.php |
Disallow | /v5/ |
Disallow | /v5dev/ |
Disallow | /chrysophylax/ |
Disallow | /ressources/files/ |
Disallow | /fr/ms/reseaupublicationsports/ |
Disallow | /en/ms/reseaupublicationsports/ |
Warnings
- 2 invalid lines.
Comments