kyeongin.com
robots.txt

Robots Exclusion Standard data for kyeongin.com

Resource Scan

Scan Details

Site Domain kyeongin.com
Base Domain kyeongin.com
Scan Status Ok
Last Scan2024-05-29T16:40:06+00:00
Next Scan 2024-06-05T16:40:06+00:00

Last Scan

Scanned2024-05-29T16:40:06+00:00
URL https://kyeongin.com/robots.txt
Redirect http://kyeongin.com/robots.txt
Domain IPs 101.55.50.7
Response IP 101.55.50.7
Found Yes
Hash 2327aba2994b909a016b7b04cf5ea917a011e89799cd6431d91f33d00a7a8227
SimHash be4646b0fcf8

Groups

*

Rule Path
Disallow /kiib/

Other Records

Field Value Comment
crawl-delay 30 10 seconds between page requests

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

trendkite-akashic-crawler

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Comments

  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot
  • Block trendkite-akashic-crawler