colearn.id
robots.txt

Robots Exclusion Standard data for colearn.id

Resource Scan

Scan Details

Site Domain colearn.id
Base Domain colearn.id
Scan Status Ok
Last Scan2024-09-16T16:09:38+00:00
Next Scan 2024-09-23T16:09:38+00:00

Last Scan

Scanned2024-09-16T16:09:38+00:00
URL https://colearn.id/robots.txt
Domain IPs 13.33.88.106, 13.33.88.107, 13.33.88.3, 13.33.88.56
Response IP 13.33.88.107
Found Yes
Hash 477734ee5935c1087d47c2e474c2c4a1b70dbf57518044e64c448a0eac3a6cda
SimHash 59409e40e6e0

Groups

*

Rule Path
Allow /
Disallow /user/verify-email?*
Disallow /.well-known/apple-app-site-association
Disallow /tanya/watch/*
Disallow /jadwal
Disallow /smartfest
Disallow /_next/data/*/tanya/*.json

gptbot

Rule Path
Allow /
Disallow /tanya/*
Disallow /user/verify-email?*
Disallow /.well-known/apple-app-site-association
Disallow /tanya/watch/*
Disallow /jadwal
Disallow /smartfest
Disallow /_next/data/*/tanya/*.json

yandex

Rule Path
Allow /
Disallow /user/verify-email?*
Disallow /.well-known/apple-app-site-association
Disallow /tanya/watch/*
Disallow /jadwal
Disallow /smartfest

linguee

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

seodiver

Rule Path
Disallow /

dataprovider

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

getintent crawler

Rule Path
Disallow /

twitterbot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

facebot

Rule Path
Disallow

Other Records

Field Value
sitemap https://colearn.id/sitemap.xml
sitemap https://colearn.id/tanya-sitemap.xml
sitemap https://colearn.id/sitemaps/chapters.xml
sitemap https://colearn.id/sitemaps/sections.xml
sitemap https://colearn.id/sitemaps/topics.xml
sitemap https://colearn.id/sitemaps/mengajar/sitemap.xml

Comments

  • *
  • GPTBot
  • Yandex
  • Linguee
  • SurdotlyBot
  • BUbiNG
  • SemrushBot-SA
  • SemrushBot
  • rogerbot
  • dotbot
  • BLEXBot
  • spbot
  • SEOdiver
  • dataprovider
  • magpie-crawler
  • GetIntent Crawler
  • Twitterbot
  • Mediapartners-Google
  • Facebot
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.