curiousblogger.com
robots.txt

Robots Exclusion Standard data for curiousblogger.com

Resource Scan

Scan Details

Site Domain curiousblogger.com
Base Domain curiousblogger.com
Scan Status Ok
Last Scan2025-10-22T01:09:15+00:00
Next Scan 2025-10-29T01:09:15+00:00

Last Scan

Scanned2025-10-22T01:09:15+00:00
URL https://curiousblogger.com/robots.txt
Domain IPs 104.21.17.247, 172.67.178.229, 2606:4700:3030::6815:11f7, 2606:4700:3036::ac43:b2e5
Response IP 172.67.178.229
Found Yes
Hash d15f6f07f3ce5d5e338f43c2430d232d4aa23a539a0cd3318da3dae9341487b2
SimHash 6571599a66ca

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /page/
Disallow /blog/page/*
Disallow /dgd_scrollbox/
Disallow /?s=*
Disallow /go/
Disallow /recommended/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /search?
Disallow /?p=*
Disallow *?replytocom
Disallow */trackback
Disallow */comments
Disallow /tag/
Disallow /draft-posts/
Disallow /recommends/
Disallow /recommend/
Disallow /go/
Disallow /suggest/
Disallow /suggests/
Disallow /2017/
Disallow /2018/
Disallow /2019/
Disallow /2020/
Disallow /thank-you/

Other Records

Field Value
sitemap https://curiousblogger.com/sitemap_index.xml

Warnings

  • 1 invalid line.