ko.hinative.com
robots.txt

Robots Exclusion Standard data for ko.hinative.com

Resource Scan

Scan Details

Site Domain ko.hinative.com
Base Domain hinative.com
Scan Status Ok
Last Scan2024-09-21T04:48:23+00:00
Next Scan 2024-10-05T04:48:23+00:00

Last Scan

Scanned2024-09-21T04:48:23+00:00
URL https://ko.hinative.com/robots.txt
Domain IPs 34.192.119.173, 34.237.17.208, 44.216.74.110, 54.156.73.10
Response IP 44.216.74.110
Found Yes
Hash d8f31f27c89e2109e019d7fbc1f6e45e18513554ea03d167ed046d0ceae5288c
SimHash cb8ece2567d1

Groups

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /*/profiles/
Allow /profiles/
Disallow /*/activities
Disallow /*/answers/
Disallow /*/questions/*/answers/
Disallow /*/search/
Disallow /*/lives/
Disallow /activities
Disallow /answers/
Disallow /questions/*/answers/
Disallow /search/
Disallow /lives/
Allow /dictionaries/search/

googlebot-image

Rule Path
Disallow /*/questions/*/ogp_image

google-extended

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

seekportbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

proximic

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

ias_crawler

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://ko.hinative.com/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2013/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2014/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2015/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2016/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2017/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2018/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2019/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2020/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2021/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2022/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2023/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/2024/sitemap.xml.gz
sitemap https://ko.hinative.com/latests/sitemap.xml.gz

Comments

  • See https://developers.google.com/search/docs/crawling-indexing/robots/intro for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /