coursesidekick.com
robots.txt

Robots Exclusion Standard data for coursesidekick.com

Resource Scan

Scan Details

Site Domain coursesidekick.com
Base Domain coursesidekick.com
Scan Status Ok
Last Scan2025-12-12T18:22:24+00:00
Next Scan 2025-12-19T18:22:24+00:00

Last Scan

Scanned2025-12-12T18:22:24+00:00
URL https://coursesidekick.com/robots.txt
Redirect https://www.coursesidekick.com/robots.txt
Redirect Domain www.coursesidekick.com
Redirect Base coursesidekick.com
Domain IPs 104.18.43.134, 172.64.144.122
Redirect IPs 104.18.43.134, 172.64.144.122
Response IP 104.18.43.134
Found Yes
Hash 86becaee0872e87e1038cbee7e4b919a5fca2e9b3e9c4afff094cb393a828f35
SimHash 7eda880341a4

Groups

*

Rule Path
Disallow /api/
Disallow /file/
Disallow /tutors-problems/
Disallow /nity-would-Royalthy-Vnman-so-Fight-All-Ang-like-
Disallow /search/
Disallow /site/
Disallow /_Incapsula_Resource
Disallow /login?
Disallow /register?
Disallow /exclusive-doc/
Disallow /exclusive-qna/
Disallow /doc-asset/font/
Disallow */fonts.css

adsbot-google
adsbot-google-mobile

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /api/
Disallow /assets/
Disallow /doc-asset/

googleother
googleother-image
googleother-video

Rule Path
Disallow /

ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot-extended
bytespider
ccbot
claudebot
cloudvertexbot
cohere-training-data-crawler
cotoyogi
dataprovider.com
datenbank crawler
dcrawl
diffbot
facebookbot
friendlycrawler
google-extended
gptbot
helloworkjobpostingbot
httrack
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
indeedjobbot
isscyberriskcrawler
kangaroo bot
meta-externalagent
metainspector
netestate imprint crawler
newspaper
nutch
offline explorer
omgili
omgilibot
openindexspider
pangubot
potions
scrapy
serverhunterspider
statsdronebot
timpibot
turnitinbot
velenpublicwebcrawler
webzio-extended
yandex

Rule Path
Disallow /

Comments

  • Main rules
  • AdsBot-Google
  • GoogleOther
  • Other