an-library.com
robots.txt
Robots Exclusion Standard data for an-library.com
Resource Scan
Scan Details
Site Domain | an-library.com |
Base Domain | an-library.com |
Scan Status | Ok |
Last Scan | 2025-09-29T04:11:32+00:00 |
Next Scan | 2025-10-06T04:11:32+00:00 |
Last Scan
Scanned | 2025-09-29T04:11:32+00:00 |
URL | https://an-library.com/robots.txt |
Domain IPs | 104.21.71.43, 172.67.143.25, 2606:4700:3035::6815:472b, 2606:4700:3035::ac43:8f19 |
Response IP | 104.21.71.43 |
Found | Yes |
Hash | 1a29aac3a87d1efb292685fd76f4bc24cd1f5c599077b0a028bc25abf1db8450 |
SimHash | a31f1d5907e5 |
Groups
*
Rule | Path |
---|---|
Disallow | /administrator/ |
Disallow | /bin/ |
Disallow | /cache/ |
Disallow | /cli/ |
Disallow | /components/ |
Disallow | /includes/ |
Disallow | /installation/ |
Disallow | /language/ |
Disallow | /layouts/ |
Disallow | /libraries/ |
Disallow | /logs/ |
Disallow | /modules/ |
Disallow | /plugins/ |
Disallow | /tmp/ |
Other Records
Field | Value |
---|---|
sitemap | http://cdn.attracta.com/sitemap/4696483.xml.gz |
Comments