launchx431.fr
robots.txt

Robots Exclusion Standard data for launchx431.fr

Resource Scan

Scan Details

Site Domain launchx431.fr
Base Domain launchx431.fr
Scan Status Ok
Last Scan2025-12-08T21:13:08+00:00
Next Scan 2026-01-07T21:13:08+00:00

Last Scan

Scanned2025-12-08T21:13:08+00:00
URL https://launchx431.fr/robots.txt
Redirect https://www.LaunchX431.fr/robots.txt
Redirect Domain www.launchx431.fr
Redirect Base launchx431.fr
Domain IPs 72.167.143.207
Redirect IPs 72.167.143.207
Response IP 72.167.143.207
Found Yes
Hash e4ae5da552bd9d2298235bfa8f7d44dc6738f56045fe673d9151d2b03521d93c
SimHash 7e5519136783

Groups

ahrefsbot
dotbot
mj12bot
semrushbot
blexbot
cloudrobo
barkrowler
censysinspect

Rule Path
Disallow /

bingbot

Rule Path
Disallow /search/
Disallow /service/?
Disallow /support/?

baiduspider

Rule Path
Disallow /search/
Disallow /service/?
Disallow /support/?
Disallow /*?*

sogou web spider
sogou inst spider
sogou spider2
sogou blog
sogou news spider
sogou orion spider

Rule Path
Disallow /

chinasospider
youdaobot
sosospider
yisouspider
easouspider

Rule Path
Disallow /

gptbot
chatgpt-user
claudebot
claude-user
claude-searchbot
ccbot
applebot-extended
facebookbot
meta-externalagent
meta-externalfetcher
diffbot
perplexitybot
perplexity‑user
omgili
omgilibot
webzio-extended
imagesiftbot
bytespider
tiktokspider
amazonbot
youbot
semrushbot-ocob
petalbot
velenpublicwebcrawler
turnitinbot
timpibot
oai-searchbot
icc-crawler
ai2bot
ai2bot-dolma
dataforseobot
awariobot
awariosmartbot
awariorssbot
pangubot
kangaroo bot
sentibot
img2dataset
meltwater
seekr
peer39_crawler
cohere-ai
cohere-training-data-crawler
duckassistbot
scrapy
cotoyogi
aihitbot
factset_spyderbot
firecrawlagent
thinkbot
aliyunsecbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

*

Rule Path
Disallow /ajax/
Disallow /api_llpay/
Disallow /api_ppec/
Disallow /api_pprest/
Disallow /app/
Disallow /inc/
Disallow /js/
Disallow /logs/
Disallow /members/
Disallow /plugins/
Disallow /up/
Disallow /vendors/
Disallow /search/*ListPager*
Disallow /service/*ListPager*
Disallow /support/*ListPager*

Other Records

Field Value
sitemap https://www.LaunchX431.fr/xml/sitemap.xml

Comments

  • Block Malicious Crawlers Spiders (Important!)
  • Block certain search engine spiders or restrict specific content from a single spider
  • Block all known AI crawlers and assistants from using content for training AI models.
  • User-Agent: Google-Extended
  • User-Agent: Google-CloudVertexBot

Warnings

  • 1 invalid line.
  • `content-usage` is not a known field.
  • `disallowaitraining` is not a known field.