activerain.com
robots.txt

Robots Exclusion Standard data for activerain.com

Resource Scan

Scan Details

Site Domain activerain.com
Base Domain activerain.com
Scan Status Ok
Last Scan2024-10-20T23:29:16+00:00
Next Scan 2024-11-19T23:29:16+00:00

Last Scan

Scanned2024-10-20T23:29:16+00:00
URL https://activerain.com/robots.txt
Domain IPs 54.69.228.114
Response IP 54.69.228.114
Found Yes
Hash b24ff155c3ec3567afad08be763a79361d17364ccba41f303f11167d5eff9ccc
SimHash 027709fd6d75

Groups

shopwiki
twiceler
voyager
echrigy
voilabot
baiduspider
yisouspider
petalbot

Product Comment
twiceler http://www.cuill.com/twiceler/robot.html
voyager http://www.kosmix.com/html/crawler.html
echrigy http://www.techrigy.com/
voilabot http://www.voila.fr/
Rule Path
Disallow /

*

Rule Path
Disallow /private_login
Disallow /beta
Disallow /blogs/rss
Disallow /blogs/shorten_url
Disallow /action/blogs/shorten_url
Disallow /blogs/atom
Disallow /blog_entry_comments?sort=
Disallow /likes_list?return_url=
Disallow /action/spellcheck
Disallow /action/blogs_admin
Disallow /action/blogs_admin/subscribe
Disallow /blogindex
Disallow /action/signup/create
Disallow /action/nav/upload_photo
Disallow /action/product_review/thumbs_up
Disallow /action/product_review/thumbs_down
Disallow /realestateandhomes-search
Disallow /action/agents/show_score
Disallow /action/blogs/comments
Disallow /reset_password
Disallow /topbloggers
Disallow /arcaptcha
Disallow /assets/arcaptcha
Disallow /username_autocomplete
Disallow /articles
Disallow /Custom
Disallow /homedetails
Disallow /models
Disallow /photos
Disallow /town-services
Disallow /wiki

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://activerain.com/sitemap_index.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /
  • User-agent: Googlebot
  • User-agent: ChatGPT

Warnings

  • 1 invalid line.