newduck.net
robots.txt

Robots Exclusion Standard data for newduck.net

Resource Scan

Scan Details

Site Domain newduck.net
Base Domain newduck.net
Scan Status Ok
Last Scan2024-11-14T01:47:53+00:00
Next Scan 2024-11-21T01:47:53+00:00

Last Scan

Scanned2024-11-14T01:47:53+00:00
URL https://newduck.net/robots.txt
Domain IPs 104.26.6.250, 104.26.7.250, 172.67.70.163, 2606:4700:20::681a:6fa, 2606:4700:20::681a:7fa, 2606:4700:20::ac43:46a3
Response IP 172.67.70.163
Found Yes
Hash 6e5f8a340683f8554ddd95faa6de30d118150cc463ee811f9617e6b6e1dd81e0
SimHash 2334055be952

Groups

ia_archiver

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

iccrawler

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

tweetmemebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

boardreader

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

*

Rule Path
Disallow /*listStyle%3D
Disallow /*act%3DIS%26
Disallow /*act%3DIS$
Disallow /*act%3DdispBoardCategory
Disallow /*act%3DprocFileDownload
Disallow /*search_keyword%3D
Disallow /*search_target%3D
Disallow /*module_srl%3D
Disallow /*act%3DdispMemberBookmark
Disallow /*_filter%3D
Disallow /*m%3D0%26
Disallow /*m%3D0$
Disallow /*m%3D1%26
Disallow /*m%3D1$
Disallow /*m%3D6%26
Disallow /*m%3D6$
Disallow /_loader