zolltarifnummern.de
robots.txt

Robots Exclusion Standard data for zolltarifnummern.de

Resource Scan

Scan Details

Site Domain zolltarifnummern.de
Base Domain zolltarifnummern.de
Scan Status Ok
Last Scan2024-06-09T10:33:17+00:00
Next Scan 2024-06-16T10:33:17+00:00

Last Scan

Scanned2024-06-09T10:33:17+00:00
URL https://zolltarifnummern.de/robots.txt
Redirect https://www.zolltarifnummern.de/robots.txt
Redirect Domain www.zolltarifnummern.de
Redirect Base zolltarifnummern.de
Domain IPs 104.21.71.60, 172.67.169.252, 2606:4700:3032::6815:473c, 2606:4700:3035::ac43:a9fc
Redirect IPs 104.21.71.60, 172.67.169.252, 2606:4700:3032::6815:473c, 2606:4700:3035::ac43:a9fc
Response IP 104.21.71.60
Found Yes
Hash 9a23bceeb34f928c5e7ea5a748146eba49531e060309358da40649d584e406ab
SimHash c803413b47b6

Groups

*

Rule Path
Disallow /forum/*
Disallow /2013/*
Disallow /2014/*
Disallow /2015/*
Disallow /2016/*
Disallow /2017/*
Disallow /2018/*
Disallow /2019/*
Disallow /2020/*
Disallow /admin/*
Disallow /api/*
Disallow /register/*
Disallow /login/*
Disallow /logout/*
Disallow /profile/*
Disallow /news/*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

turnitin

Rule Path
Disallow /

youbot

Rule Path
Disallow /

mazbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

proximic

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.zolltarifnummern.de/sitemap_de.xml
sitemap https://www.tariffnumber.com/sitemap_en.xml
sitemap https://www.tarifdouanier.eu/sitemap_fr.xml

Comments

  • Our content is made available under our terms and conditions of use.
  • Any other uses are not permitted, incl. but not limited to: for large language
  • models (LLMs), machine learning and/or artificial intelligence-related purposes
  • old datasets
  • auth
  • temporary
  • disallow various bots https://www.theguardian.com/robots.txt