1ebook.org
robots.txt

Robots Exclusion Standard data for 1ebook.org

Resource Scan

Scan Details

Site Domain 1ebook.org
Base Domain 1ebook.org
Scan Status Ok
Last Scan2024-06-08T16:29:19+00:00
Next Scan 2024-07-08T16:29:19+00:00

Last Scan

Scanned2024-06-08T16:29:19+00:00
URL http://1ebook.org/robots.txt
Domain IPs 38.12.202.246
Response IP 38.12.202.246
Found Yes
Hash 1e943408c93355e3f15834457a391d79dd84a937731dd93250fc5587d9982843
SimHash 685555834787

Groups

baiduspider

Rule Path
Disallow

baiduspider-image

Rule Path
Disallow

baiduspider-render

Rule Path
Disallow

sosospider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow

sogou inst spider

Rule Path
Disallow

sogou spider2

Rule Path
Disallow

sogou news spider

Rule Path
Disallow

sogou orion spider

Rule Path
Disallow

jikespider

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

googlebot-mobile

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow /

psbot

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap /sitemap.xml

Comments

  • ÷¼÷ÃÏÀ×îÇ¿robots Ö»ÔÊÐí°Ù¶È

Warnings

  • 2 invalid lines.