korii.slate.fr
robots.txt

Robots Exclusion Standard data for korii.slate.fr

Resource Scan

Scan Details

Site Domain korii.slate.fr
Base Domain slate.fr
Scan Status Ok
Last Scan2024-10-31T12:41:38+00:00
Next Scan 2024-11-30T12:41:38+00:00

Last Scan

Scanned2024-10-31T12:41:38+00:00
URL https://korii.slate.fr/robots.txt
Domain IPs 104.22.46.201, 104.22.47.201, 172.67.9.244, 2606:4700:10::6816:2ec9, 2606:4700:10::6816:2fc9, 2606:4700:10::ac43:9f4
Response IP 104.22.47.201
Found Yes
Hash e5955e8cc1d1271f9f4873d35d54c157a96b6153ecfe7e665a8aac6b4e8318e4
SimHash f41710526605

Groups

*

Rule Path
Disallow /assets/
Disallow /board/
Disallow /channels/
Disallow /components/
Disallow /controllers/
Disallow /formatters/
Disallow /helpers/
Disallow /javascript/
Disallow /jobs/
Disallow /mailers/
Disallow /models/
Disallow /policies/
Disallow /services/
Disallow /uploads/
Disallow /uploaders/
Disallow /views/
Disallow /workers/
Allow /assets/*.css$
Allow /assets/*.css?
Allow /assets/*.js$
Allow /assets/*.js?
Allow /assets/*.gif
Allow /assets/*.jpg
Allow /assets/*.jpeg
Allow /assets/*.png
Allow /assets/*.woff$
Allow /assets/*.woff2$
Allow /assets/*.ttf$
Allow /assets/*.eot$
Allow /assets/*.svg$
Allow /uploads/store/
Disallow /api/
Disallow /audio/board/
Disallow /blognews/
Disallow /BLOGS_CHRONIQUES/
Disallow /cdn-cgi/
Disallow /embed/
Disallow /plus/
Disallow /reader/
Disallow /slate_amazon/
Disallow /slate_podcasts/
Disallow /slate_theme_serie/
Disallow /slate_views_topics/
Disallow /widgets/
Disallow /21685585667/

Other Records

Field Value
crawl-delay 10

googlebot-news
dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 900

orangebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 900

adequat

Rule Path
Disallow /

adequat-systems

Rule Path
Disallow /

amisoftware

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

ask n read

Rule Path
Disallow /

asknread.com

Rule Path
Disallow /

augure

Rule Path
Disallow /

auramundi

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cision

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

coexel

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

converacrawler

Rule Path
Disallow /

corporama

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

digimind

Rule Path
Disallow /

ellisphere

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

fast

Rule Path
Disallow /

fetch

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

infoseek

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

kbcrawl

Rule Path
Disallow /

knowings

Rule Path
Disallow /

leadbox

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

mention

Rule Path
Disallow /

meta-externalagent
meta-externalagent

Rule Path
Disallow /

moreover

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

mytwip

Rule Path
Disallow /

newzbin

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

opinion-tracker

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

proxem

Rule Path
Disallow /

qwam content intelligence

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

score3

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

sindup

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

talkwater

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

trendeo

Rule Path
Disallow /

trendybuzz

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

vecteurplus

Rule Path
Disallow /

verticalsearch

Rule Path
Disallow /

vsw

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

wget

Rule Path
Disallow /

winello

Rule Path
Disallow /

youbot

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://korii.slate.fr/sitemap.xml
sitemap https://korii.slate.fr/googlenews.xml

Comments

  • Directories
  • CSS, JS, Images
  • V4
  • Robots exclus

Warnings

  • 4 invalid lines.
  • `host` is not a known field.