uptain.de
robots.txt

Robots Exclusion Standard data for uptain.de

Resource Scan

Scan Details

Site Domain uptain.de
Base Domain uptain.de
Scan Status Ok
Last Scan2026-02-11T01:24:34+00:00
Next Scan 2026-03-13T01:24:34+00:00

Last Scan

Scanned2026-02-11T01:24:34+00:00
URL https://uptain.de/robots.txt
Domain IPs 104.26.4.84, 104.26.5.84, 172.67.72.77, 2606:4700:20::681a:454, 2606:4700:20::681a:554, 2606:4700:20::ac43:484d
Response IP 172.67.72.77
Found Yes
Hash cf6bf6f68db07e97d002405453b0679103af9c4f00b7bef1addbe9db3692583d
SimHash 73b6c30175c2

Groups

*

Rule Path
Disallow /page/
Disallow /category/
Disallow /tag/

ai2bot

Rule Path
Disallow

amazonbot

Rule Path
Disallow

claudebot

Rule Path
Disallow

claude-user

Rule Path
Disallow

applebot

Rule Path
Disallow

applebot-extended

Rule Path
Disallow

bingbot

Rule Path
Disallow

bytespider

Rule Path
Disallow

ccbot

Rule Path
Disallow

gptbot

Rule Path
Disallow

chatgpt-user

Rule Path
Disallow

oai-searchbot

Rule Path
Disallow

cohere-ai

Rule Path
Disallow

diffbot

Rule Path
Disallow

duckassistbot

Rule Path
Disallow

facebookbot

Rule Path
Disallow

meta-externalagent

Rule Path
Disallow

google-extended

Rule Path
Disallow

linkedinbot

Rule Path
Disallow

omgili

Rule Path
Disallow

perplexitybot

Rule Path
Disallow

timpibot

Rule Path
Disallow

youbot

Rule Path
Disallow

Comments

  • General
  • Allen Institute
  • Amazon
  • Anthropic
  • Apple
  • Microsoft
  • ByteDance
  • Common Crawl
  • OpenAI
  • Cohere
  • Diffbot
  • DuckDuckGo
  • Meta
  • Google
  • LinkedIn
  • Omgili
  • Perplexity
  • Timpi
  • You.com