uaptc.edu
robots.txt

Robots Exclusion Standard data for uaptc.edu

Resource Scan

Scan Details

Site Domain uaptc.edu
Base Domain uaptc.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-03-13T13:30:08+00:00
Next Scan 2024-06-11T13:30:08+00:00

Last Successful Scan

Scanned2023-10-23T10:09:45+00:00
URL https://uaptc.edu/robots.txt
Domain IPs 104.26.8.242, 104.26.9.242, 172.67.69.46, 2606:4700:20::681a:8f2, 2606:4700:20::681a:9f2, 2606:4700:20::ac43:452e
Response IP 172.67.69.46
Found Yes
Hash 7bc45a1b81f1f1418741732d84b32a5c1374c73d390c3e3c61745a33081f3e3a
SimHash 11516849cf31

Groups

*

Rule Path Comment
Disallow /docs/default-source/* Block the doc directory.
Disallow *.pdf Block pdf files. Non-standard but works for major search engines
Disallow /images/default-source/* Block images directory