tarifdouanier.eu
robots.txt

Robots Exclusion Standard data for tarifdouanier.eu

Resource Scan

Scan Details

Site Domain tarifdouanier.eu
Base Domain tarifdouanier.eu
Scan Status Ok
Last Scan2024-06-08T10:58:57+00:00
Next Scan 2024-06-15T10:58:57+00:00

Last Scan

Scanned2024-06-08T10:58:57+00:00
URL https://tarifdouanier.eu/robots.txt
Redirect https://www.tarifdouanier.eu/robots.txt
Redirect Domain www.tarifdouanier.eu
Redirect Base tarifdouanier.eu
Domain IPs 104.26.4.127, 104.26.5.127, 172.67.69.157, 2606:4700:20::681a:47f, 2606:4700:20::681a:57f, 2606:4700:20::ac43:459d
Redirect IPs 104.26.4.127, 104.26.5.127, 172.67.69.157, 2606:4700:20::681a:47f, 2606:4700:20::681a:57f, 2606:4700:20::ac43:459d
Response IP 172.67.69.157
Found Yes
Hash 9a23bceeb34f928c5e7ea5a748146eba49531e060309358da40649d584e406ab
SimHash c803413b47b6

Groups

*

Rule Path
Disallow /forum/*
Disallow /2013/*
Disallow /2014/*
Disallow /2015/*
Disallow /2016/*
Disallow /2017/*
Disallow /2018/*
Disallow /2019/*
Disallow /2020/*
Disallow /admin/*
Disallow /api/*
Disallow /register/*
Disallow /login/*
Disallow /logout/*
Disallow /profile/*
Disallow /news/*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

turnitin

Rule Path
Disallow /

youbot

Rule Path
Disallow /

mazbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

proximic

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.zolltarifnummern.de/sitemap_de.xml
sitemap https://www.tariffnumber.com/sitemap_en.xml
sitemap https://www.tarifdouanier.eu/sitemap_fr.xml

Comments

  • Our content is made available under our terms and conditions of use.
  • Any other uses are not permitted, incl. but not limited to: for large language
  • models (LLMs), machine learning and/or artificial intelligence-related purposes
  • old datasets
  • auth
  • temporary
  • disallow various bots https://www.theguardian.com/robots.txt