newsroot.co.kr
robots.txt

Robots Exclusion Standard data for newsroot.co.kr

Resource Scan

Scan Details

Site Domain newsroot.co.kr
Base Domain newsroot.co.kr
Scan Status Ok
Last Scan2024-06-24T08:37:45+00:00
Next Scan 2024-07-01T08:37:45+00:00

Last Scan

Scanned2024-06-24T08:37:45+00:00
URL https://newsroot.co.kr/robots.txt
Domain IPs 183.111.174.75
Response IP 183.111.174.75
Found Yes
Hash 850ef26f7b56def2459145682fc842d6171bea0d67d7528b00eed5134366b6e3
SimHash 52549060af18

Groups

*

Rule Path
Allow /ads.txt

ahrefsbot
amazonbot
arachni
baiduspider
baiduspider
baiduspider+
bbot
blexbot
brands-bot
dataforseo-bot
dotbot
exabot
eyeotabot
megaindex
mj12bot
petalbot
semrushbot
wordpress

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /do-not-crawl/

*

Rule Path
Disallow /not-allowed/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow