learnist.org
robots.txt

Robots Exclusion Standard data for learnist.org

Resource Scan

Scan Details

Site Domain learnist.org
Base Domain learnist.org
Scan Status Ok
Last Scan2024-09-26T05:23:01+00:00
Next Scan 2024-10-03T05:23:01+00:00

Last Scan

Scanned2024-09-26T05:23:01+00:00
URL https://learnist.org/robots.txt
Redirect https://www.learnist.org/robots.txt
Redirect Domain www.learnist.org
Redirect Base learnist.org
Domain IPs 104.21.27.57, 172.67.141.120, 2606:4700:3033::ac43:8d78, 2606:4700:3035::6815:1b39
Redirect IPs 104.21.27.57, 172.67.141.120, 2606:4700:3033::ac43:8d78, 2606:4700:3035::6815:1b39
Response IP 172.67.141.120
Found Yes
Hash 340e606f4a666cc6bf410470d62e2c6f176278723a2531e0a0aa805468b4e8c0
SimHash 4000da0243b2

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php