ieslasmusas.org
robots.txt

Robots Exclusion Standard data for ieslasmusas.org

Resource Scan

Scan Details

Site Domain ieslasmusas.org
Base Domain ieslasmusas.org
Scan Status Ok
Last Scan2026-01-09T14:04:49+00:00
Next Scan 2026-02-08T14:04:49+00:00

Last Scan

Scanned2026-01-09T14:04:49+00:00
URL https://ieslasmusas.org/robots.txt
Domain IPs 82.194.68.78
Response IP 82.194.68.78
Found Yes
Hash 9f379093df2c56b9dc2f8ba7b3ced62416b3101d3ee4f1928a17456e4800870f
SimHash 6845d8900e16

Groups

*

Rule Path
Disallow /wp-content/*
Disallow /wp-content/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-includes/
Disallow /wp-admin/
Disallow /biblio/
Disallow /clasesenlinea/
Disallow /biblioteca/
Disallow /wp-
Disallow /?s=
Disallow /search
Allow /feed/$
Disallow /feed
Disallow /comments/feed
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 50

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10