insurancejournal.com
robots.txt

Robots Exclusion Standard data for insurancejournal.com

Resource Scan

Scan Details

Site Domain insurancejournal.com
Base Domain insurancejournal.com
Scan Status Ok
Last Scan2024-10-17T08:00:54+00:00
Next Scan 2024-11-16T08:00:54+00:00

Last Scan

Scanned2024-10-17T08:00:54+00:00
URL https://insurancejournal.com/robots.txt
Redirect https://www.insurancejournal.com/robots.txt
Redirect Domain www.insurancejournal.com
Redirect Base insurancejournal.com
Domain IPs 169.61.31.50
Redirect IPs 169.61.31.50
Response IP 169.61.31.50
Found Yes
Hash 88179b364ff413161e0e2a2903f55bd99366c9aa8fbf54b9156efda55356da35
SimHash 0930da25e0b1

Groups

googlebot-news

Rule Path
Disallow /*?comments$

trendkite-akashic-crawler

Rule Path
Disallow /

*

Rule Path
Disallow /ads/
Disallow /tmp/
Disallow /example/
Disallow /scholarship/
Disallow /subscribe/unsubscribe.php
Disallow /adshowcase/search/tag_article
Disallow /feedback/
Disallow /magazines/page/
Disallow /phpadsnew/
Disallow /openx/
Disallow /directories-new/
Disallow /advertise/about-us-2021/
Disallow /wp/wp-admin/
Allow /wp/wp-admin/admin-ajax.php

Comments

  • Keep google news bot from certain areas
  • Block trendkite-akashic-crawler
  • currently meta robots noindexing
  • Disallow: /*?download$
  • Disallow: /forums/memberlist.php
  • Disallow: /research/research/success/