tariffnumber.com
robots.txt

Robots Exclusion Standard data for tariffnumber.com

Resource Scan

Scan Details

Site Domain tariffnumber.com
Base Domain tariffnumber.com
Scan Status Ok
Last Scan2024-05-31T15:39:44+00:00
Next Scan 2024-06-07T15:39:44+00:00

Last Scan

Scanned2024-05-31T15:39:44+00:00
URL https://tariffnumber.com/robots.txt
Redirect https://www.tariffnumber.com/robots.txt
Redirect Domain www.tariffnumber.com
Redirect Base tariffnumber.com
Domain IPs 104.21.69.156, 172.67.209.165, 2606:4700:3034::6815:459c, 2606:4700:3034::ac43:d1a5
Redirect IPs 104.21.69.156, 172.67.209.165, 2606:4700:3034::6815:459c, 2606:4700:3034::ac43:d1a5
Response IP 172.67.209.165
Found Yes
Hash 9a23bceeb34f928c5e7ea5a748146eba49531e060309358da40649d584e406ab
SimHash c803413b47b6

Groups

*

Rule Path
Disallow /forum/*
Disallow /2013/*
Disallow /2014/*
Disallow /2015/*
Disallow /2016/*
Disallow /2017/*
Disallow /2018/*
Disallow /2019/*
Disallow /2020/*
Disallow /admin/*
Disallow /api/*
Disallow /register/*
Disallow /login/*
Disallow /logout/*
Disallow /profile/*
Disallow /news/*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

turnitin

Rule Path
Disallow /

youbot

Rule Path
Disallow /

mazbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

proximic

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.zolltarifnummern.de/sitemap_de.xml
sitemap https://www.tariffnumber.com/sitemap_en.xml
sitemap https://www.tarifdouanier.eu/sitemap_fr.xml

Comments

  • Our content is made available under our terms and conditions of use.
  • Any other uses are not permitted, incl. but not limited to: for large language
  • models (LLMs), machine learning and/or artificial intelligence-related purposes
  • old datasets
  • auth
  • temporary
  • disallow various bots https://www.theguardian.com/robots.txt