teacherscollegesj.org
robots.txt
Robots Exclusion Standard data for teacherscollegesj.org
Resource Scan
Scan Details
Site Domain | teacherscollegesj.org |
Base Domain | teacherscollegesj.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-23T22:44:27+00:00 |
Next Scan | 2024-10-07T22:44:27+00:00 |
Last Successful Scan
Scanned | 2024-09-08T22:43:39+00:00 |
URL | https://teacherscollegesj.org/robots.txt |
Domain IPs | 104.21.37.20, 172.67.203.68, 2606:4700:3030::6815:2514, 2606:4700:3036::ac43:cb44 |
Response IP | 172.67.203.68 |
Found | Yes |
Hash | 61afafa2ab43a1d82740777e8a03acf63c0b60542bd7fe008881e7d859a9953a |
SimHash | 0081df764a99 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | /?s= |
Disallow | *%26s%3D |
Disallow | /search |
Disallow | /author/ |
Disallow | */feed |
Disallow | */rss |
Disallow | /contacts |
Allow | /wp-content/uploads/ |
Allow | /wp-content/themes/ |
Allow | /*/*.js |
Allow | /*/*.css |
Allow | /wp-*.png |
Allow | /wp-*.jpg |
Allow | /wp-*.jpeg |
Allow | /wp-*.gif |
Allow | /wp-*.svg |
Allow | /wp-*.pdf |
Other Records
Field | Value |
---|---|
sitemap | /wp-sitemap.xml |