mla.org
robots.txt

Robots Exclusion Standard data for mla.org

Resource Scan

Scan Details

Site Domain mla.org
Base Domain mla.org
Scan Status Ok
Last Scan2024-05-09T05:57:37+00:00
Next Scan 2024-06-08T05:57:37+00:00

Last Scan

Scanned2024-05-09T05:57:37+00:00
URL https://mla.org/robots.txt
Redirect https://www.mla.org/robots.txt
Redirect Domain www.mla.org
Redirect Base mla.org
Domain IPs 52.71.164.101
Redirect IPs 13.226.120.112, 13.226.120.121, 13.226.120.24, 13.226.120.31, 2600:9000:200a:2e00:19:2a30:1480:93a1, 2600:9000:200a:7600:19:2a30:1480:93a1, 2600:9000:200a:7a00:19:2a30:1480:93a1, 2600:9000:200a:8e00:19:2a30:1480:93a1, 2600:9000:200a:9400:19:2a30:1480:93a1, 2600:9000:200a:a400:19:2a30:1480:93a1, 2600:9000:200a:a800:19:2a30:1480:93a1, 2600:9000:200a:d200:19:2a30:1480:93a1
Response IP 13.227.254.10
Found Yes
Hash ec1ab0151477b95524e04a5b83b471418e5c2d681eb633df681119b167f25d26
SimHash 681151c1c0f1

Groups

*

Rule Path
Disallow /

googlebot
bingbot
bingpreview
msnbot
slurp
duckduckbot
applebot
ia_archiver
facebookexternalhit
twitterbot
linkedinbot

Rule Path
Allow /
Disallow /admin/
Disallow /site_admin/
Disallow /search
Disallow /search/
Disallow /content/search
Disallow /content/advancedsearch
Disallow /content/tipafriend
Disallow /layout/set/print
Disallow /rss
Disallow /media/
Disallow /ezinfo/
Disallow /user/
Disallow /test-area/
Disallow /bulletin/search/
Disallow /shop/search/

Comments

  • Disallow all
  • But allow only important bots