teacherscollegesj.org
robots.txt

Robots Exclusion Standard data for teacherscollegesj.org

Resource Scan

Scan Details

Site Domain teacherscollegesj.org
Base Domain teacherscollegesj.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-23T22:44:27+00:00
Next Scan 2024-10-07T22:44:27+00:00

Last Successful Scan

Scanned2024-09-08T22:43:39+00:00
URL https://teacherscollegesj.org/robots.txt
Domain IPs 104.21.37.20, 172.67.203.68, 2606:4700:3030::6815:2514, 2606:4700:3036::ac43:cb44
Response IP 172.67.203.68
Found Yes
Hash 61afafa2ab43a1d82740777e8a03acf63c0b60542bd7fe008881e7d859a9953a
SimHash 0081df764a99

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow */feed
Disallow */rss
Disallow /contacts
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

Other Records

Field Value
sitemap /wp-sitemap.xml