iep-edu.pe
robots.txt

Robots Exclusion Standard data for iep-edu.pe

Resource Scan

Scan Details

Site Domain iep-edu.pe
Base Domain iep-edu.pe
Scan Status Ok
Last Scan2024-05-11T02:12:37+00:00
Next Scan 2024-06-10T02:12:37+00:00

Last Scan

Scanned2024-05-11T02:12:37+00:00
URL https://iep-edu.pe/robots.txt
Redirect https://www.iep-edu.pe/robots.txt
Redirect Domain www.iep-edu.pe
Redirect Base iep-edu.pe
Domain IPs 104.26.0.249, 104.26.1.249, 172.67.72.203, 2606:4700:20::681a:1f9, 2606:4700:20::681a:f9, 2606:4700:20::ac43:48cb
Redirect IPs 104.26.0.249, 104.26.1.249, 172.67.72.203, 2606:4700:20::681a:1f9, 2606:4700:20::681a:f9, 2606:4700:20::ac43:48cb
Response IP 104.26.1.249
Found Yes
Hash 3e74716d99fef09e88338102fab34172089a34b4b13678c3932ee5c78e09bb3a
SimHash 4e545c0ac620

Groups

*

Rule Path
Disallow /feed/
Disallow /trackback/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /wp-

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

psbot

Rule Path
Disallow /

asterias

Rule Path
Disallow /

yandex

Rule Path
Disallow /