thecollegeinvestor.com
robots.txt

Robots Exclusion Standard data for thecollegeinvestor.com

Resource Scan

Scan Details

Site Domain thecollegeinvestor.com
Base Domain thecollegeinvestor.com
Scan Status Ok
Last Scan2024-11-13T10:16:15+00:00
Next Scan 2024-11-20T10:16:15+00:00

Last Scan

Scanned2024-11-13T10:16:15+00:00
URL https://thecollegeinvestor.com/robots.txt
Domain IPs 162.159.134.42
Response IP 162.159.134.42
Found Yes
Hash 2a363b102c180064f1a6271f0999e627e4437200c93823da6b735e4268a437e6
SimHash fc49111f2731

Groups

*

Rule Path
Disallow /tags/
Disallow /feed/
Disallow /go/

anthropic-ai

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-cloudvertexbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meta-externalagent
meta-externalagent

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

quora-bot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thecollegeinvestor.com/sitemap_index.xml
sitemap https://thecollegeinvestor.com/author-sitemap.xml
sitemap https://thecollegeinvestor.com/news-sitemap.xml

Comments

  • The College Investor content is made available for your personal, non-commercial
  • use subject to our Terms of Use here:
  • https://thecollegeinvestor.com/terms-of-service/.
  • Use of any device, tool, or process designed to data mine or scrape the content
  • using automated means is prohibited without prior written permission from
  • The College Investor LLC. Prohibited uses include but are not limited to:
  • (1) text and data mining activities under Art. 4 of the EU Directive on Copyright in
  • the Digital Single Market;
  • (2) the development of any software, machine learning, artificial intelligence (AI),
  • and/or large language models (LLMs);
  • (3) creating or providing archived or cached data sets containing our content to others; and/or
  • (4) any commercial purposes.
  • Disallow Rules
  • Sitemaps