10000en.jp
robots.txt

Robots Exclusion Standard data for 10000en.jp

Resource Scan

Scan Details

Site Domain 10000en.jp
Base Domain 10000en.jp
Scan Status Ok
Last Scan2024-11-07T06:02:55+00:00
Next Scan 2024-12-07T06:02:55+00:00

Last Scan

Scanned2024-11-07T06:02:55+00:00
URL https://10000en.jp/robots.txt
Domain IPs 18.176.127.238, 35.79.188.125
Response IP 18.176.127.238
Found Yes
Hash 1db2909f760ed5dbb4945ca8bd3ddb5e78217db0c69f66e3a60b8695724048da
SimHash 5014cc40a689

Groups

*

Rule Path
Disallow /wp/wp-admin/

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

cincraw

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

facebook

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

linespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://10000en.jp/sitemap_index.xml

Comments

  • 2021-10-06 Disallow
  • 2021-10-07
  • 2021-10-06 30sec
  • 2021-10-06 60sec
  • 2021-10-06 60sec
  • Crawl-delay: 30
  • 2021-10-06 10sec
  • User-Agent: Googlebot
  • Crawl-delay: 10