edumail.icu
robots.txt

Robots Exclusion Standard data for edumail.icu

Resource Scan

Scan Details

Site Domain edumail.icu
Base Domain edumail.icu
Scan Status Ok
Last Scan2025-03-11T22:43:12+00:00
Next Scan 2025-03-18T22:43:12+00:00

Last Scan

Scanned2025-03-11T22:43:12+00:00
URL https://edumail.icu/robots.txt
Domain IPs 104.21.46.123, 172.67.138.142, 2606:4700:3030::6815:2e7b, 2606:4700:3030::ac43:8a8e
Response IP 104.21.46.123
Found Yes
Hash e014d5f9a7f48ddeb3ded063d7f88c2b67eb006f96bcdf3b4977a6c8cfda9714
SimHash 2054dc924503

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

robozilla

Rule Path
Disallow

slurp

Rule Path
Disallow

gigabot

Rule Path
Disallow

msnbot

Rule Path
Disallow

teoma

Rule Path
Disallow

nutch

Rule Path
Disallow

baiduspider

Rule Path
Disallow

naverbot

Rule Path
Disallow

yeti

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow

ia_archiver/v3.9

Rule Path
Disallow

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://edumail.icu/sitemap.xml