cancertherapyadvisor.com
robots.txt

Robots Exclusion Standard data for cancertherapyadvisor.com

Resource Scan

Scan Details

Site Domain cancertherapyadvisor.com
Base Domain cancertherapyadvisor.com
Scan Status Ok
Last Scan2024-09-18T14:23:03+00:00
Next Scan 2024-09-25T14:23:03+00:00

Last Scan

Scanned2024-09-18T14:23:03+00:00
URL https://cancertherapyadvisor.com/robots.txt
Redirect https://www.cancertherapyadvisor.com:443/robots.txt
Redirect Domain www.cancertherapyadvisor.com
Redirect Base cancertherapyadvisor.com
Domain IPs 3.213.135.71, 34.192.100.88, 34.204.64.214
Redirect IPs 104.18.30.241, 104.18.31.241, 2606:4700::6812:1ef1, 2606:4700::6812:1ff1
Response IP 104.18.31.241
Found Yes
Hash 37f42f8fa40e42bfdecc56455b130a26d7a6a17e10543847797ae70316d14eaa
SimHash 62aa5f26c576

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /?s=
Disallow /*%26s%3D
Disallow /search/
Disallow /register/
Disallow /letter/*
Disallow */email/
Disallow */emailArticle/
Disallow */emailreview/
Disallow */printreview/
Disallow */emailGroupTest/
Disallow */printGroupTest/
Disallow /AdZone/InitAjax
Disallow /accountmedical/
Disallow */Web%20Support/
Disallow /article/articletrack/*
Disallow /home/page/*
Disallow /home/archive/*

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

mj12bot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.empr.com/hm-drug-sitemap1.xml
sitemap https://www.empr.com/hm-drug-sitemap2.xml
sitemap https://www.empr.com/hm-drug-sitemap3.xml
sitemap https://www.empr.com/hm-drug-sitemap4.xml
sitemap https://www.empr.com/hm-drug-sitemap5.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap1.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap2.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap3.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap4.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap5.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap6.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap7.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap8.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap9.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap10.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap11.xml
sitemap https://www.cancertherapyadvisor.com/post-sitemap12.xml
sitemap https://www.cancertherapyadvisor.com/page-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/clinicianpov-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/counselingconnection-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/diagnosticupdate-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/drugprimer-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/howtotreat-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/downloadingthedata-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/meetinginsight-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/hm-section-front-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/hm-slideshow-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/hm-quiz-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/hm-playlist-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/hm-company-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/category-sitemap.xml
sitemap https://www.cancertherapyadvisor.com/authors-sitemap1.xml
sitemap https://www.cancertherapyadvisor.com/authors-sitemap2.xml
sitemap https://www.cancertherapyadvisor.com/authors-sitemap3.xml
sitemap https://www.cancertherapyadvisor.com/authors-sitemap4.xml
sitemap https://www.cancertherapyadvisor.com/news-sitemap.xml

Comments

  • Robots.txt - created by the Virtual Robots.txt WordPress plugin.
  • Source: https://gitlab.com/mikelking/virtual-robots-txt
  • Plugin: https://www.wordpress.org/plugins/virtual-robots-txt/
  • Specifications https://developers.google.com/search/reference/robots_txt
  • VALIDATOR: https://technicalseo.com/seo-tools/robots-txt/
  • Adding disallows for AI bots and users 10/2023
  • Block OpenAI
  • Block Google Bard AI
  • Block Common Crawl
  • Author: Mikel King @ Olivent.com

Warnings

  • 1 invalid line.