jd.com
robots.txt

Robots Exclusion Standard data for jd.com

Resource Scan

Scan Details

Site Domain jd.com
Base Domain jd.com
Scan Status Ok
Last Scan2025-09-02T22:49:38+00:00
Next Scan 2025-10-02T22:49:38+00:00

Last Scan

Scanned2025-09-02T22:49:38+00:00
URL https://jd.com/robots.txt
Domain IPs 106.39.171.134, 111.13.149.108, 211.144.24.218, 211.144.27.126
Response IP 211.144.24.218
Found Yes
Hash 3e033ad639e1be74865f620abd440088a90fd889f9c55584ecaf4a166987be1b
SimHash 6475c1366293

Groups

*

Rule Path
Disallow /*?

googlebot

Rule Path
Allow /*?

googlebot-image

Rule Path
Allow /*?

googlebot-mobile

Rule Path
Allow /*?

googlebot-video

Rule Path
Allow /*?

googlebot-news

Rule Path
Allow /*?

bingbot

Rule Path
Allow /*?

msnbot

Rule Path
Allow /*?

bingbot-mobile

Rule Path
Allow /*?

baiduspider

Rule Path
Allow /*?

baiduspider-image

Rule Path
Allow /*?

baiduspider-video

Rule Path
Allow /*?

yandexbot

Rule Path
Allow /*?

yandexbot-image

Rule Path
Allow /*?

yandexbot-mobile

Rule Path
Allow /*?

sogou web spider

Rule Path
Allow /*?

sogou pic spider

Rule Path
Allow /*?

haosouspider

Rule Path
Allow /*?

sosoimagespider

Rule Path
Allow /*?

sosospider

Rule Path
Allow /*?

Warnings

  • 2 invalid lines.