ebooks.com
robots.txt

Robots Exclusion Standard data for ebooks.com

Resource Scan

Scan Details

Site Domain ebooks.com
Base Domain ebooks.com
Scan Status Ok
Last Scan2024-10-20T07:51:48+00:00
Next Scan 2024-11-19T07:51:48+00:00

Last Scan

Scanned2024-10-20T07:51:48+00:00
URL https://ebooks.com/robots.txt
Redirect https://www.ebooks.com/robots.txt
Redirect Domain www.ebooks.com
Redirect Base ebooks.com
Domain IPs 104.20.29.108, 104.20.30.108
Redirect IPs 104.20.29.108, 104.20.30.108
Response IP 104.20.29.108
Found Yes
Hash 4934d286babb58b391ba373a723a88b8ef6f3268c78bd722e5f22d688d05d063
SimHash 6c105386c2fa

Groups

facebookexternalhit

Rule Path
Allow /

adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
adidxbot
ahrefs
applebot
applenewsbot
baiduspider
baiduspider-image
baiduspider-news
baiduspider-video
bingbot
bingpreview
bublupbot
ccbot
cliqzbot
coccoc
coccocbot-image
coccocbot-web
daumoa
dazoobot
deusu
duckduckbot
duckduckgo-favicons-bot
euripbot
exploratodo
facebookcatalog
facebookexternalhit
facebot
feedly
findxbot
googlebot
googlebot-image
googlebot-mobile
googlebot-news
googlebot-video
haosouspider
ichiro
istellabot
jikespider
lycos
mail.ru
mediapartners-google
microsoftpreview
mojeekbot
msnbot
msnbot-media
orangebot
pinterest
plukkie
qwantify
rambler
seznambot
sosospider
slurp
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider2
sogou web spider
sputnikbot
teoma
twitterbot
whatsapp
wotbox
yacybot
yandex
yandexmobilebot
yeti
yioopbot
yoozbot
youdaobot

Rule Path
Disallow /account/*
Disallow /cart/*
Disallow /cj.asp
Disallow /en-*/cj.asp
Disallow /api/user/*
Disallow /api/book/*/review

Other Records

Field Value
crawl-delay 5

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ebooks.com/sitemap/sitemap-index.xml

Warnings

  • 3 invalid lines.