kukai.icu
robots.txt

Robots Exclusion Standard data for kukai.icu

Resource Scan

Scan Details

Site Domain kukai.icu
Base Domain kukai.icu
Scan Status Ok
Last Scan2025-12-14T23:55:31+00:00
Next Scan 2025-12-21T23:55:31+00:00

Last Scan

Scanned2025-12-14T23:55:31+00:00
URL https://kukai.icu/robots.txt
Domain IPs 43.169.12.122
Response IP 43.169.12.122
Found Yes
Hash 78f608d1ed74f5a2c5a42169baf9c96b3fa59a734fe62c072216043b7027b953
SimHash 015fd0010546

Groups

baiduspider

Rule Path
Disallow

sosospider

Rule Path
Disallow

sogou spider

Rule Path
Disallow

yodaobot

Rule Path
Disallow

googlebot

Rule Path
Disallow

bingbot

Rule Path
Disallow

slurp

Rule Path
Disallow

teoma

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

msnbot

Rule Path
Disallow

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow /

psbot

Rule Path
Disallow /

*

Rule Path
Disallow
Disallow /wp-admin
Disallow /wp-content/plugins
Disallow /wp-content/themes
Disallow /wp-includes
Disallow /trackback
Disallow /feed
Disallow /comments

Other Records

Field Value
sitemap http://www.kukai.icu/sitemap.xml
sitemap http://www.kukai.icu/sitemap-posttype-sites.xml

Comments

  • robots.txt generated at http://www.kukai.icu/sitemap.xml