etudes-litteraires.com
robots.txt

Robots Exclusion Standard data for etudes-litteraires.com

Resource Scan

Scan Details

Site Domain etudes-litteraires.com
Base Domain etudes-litteraires.com
Scan Status Ok
Last Scan2024-11-05T04:30:07+00:00
Next Scan 2024-11-12T04:30:07+00:00

Last Scan

Scanned2024-11-05T04:30:07+00:00
URL https://etudes-litteraires.com/robots.txt
Domain IPs 2a02:4780:84:59f8:1d05:f1e4:15db:98b1, 84.32.84.58
Response IP 77.37.66.110
Found Yes
Hash ea6829df74355b9cd62c02ad1736434627c5ea41e127c47294fd0c4fde8edf57
SimHash 521c89538891

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /_img/contact/
Disallow /*?amp$
Disallow /*.rss
Disallow /*?PageSpeed=noscript
Disallow /*?iframe=true
Disallow /forum/?q=*
Disallow /forum/u/

ahrefsbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

dubbotbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

feedfetcher-google

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

omgili

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

senutobot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.etudes-litteraires.com/page-sitemap.xml