neuromatch.social
robots.txt

Robots Exclusion Standard data for neuromatch.social

Resource Scan

Scan Details

Site Domain neuromatch.social
Base Domain neuromatch.social
Scan Status Ok
Last Scan2025-10-27T15:49:04+00:00
Next Scan 2025-10-28T15:49:04+00:00

Last Scan

Scanned2025-10-27T15:49:04+00:00
URL https://neuromatch.social/robots.txt
Domain IPs 97.107.137.53
Response IP 97.107.137.53
Found Yes
Hash f18b57dfe5177d4b7adca891cef87fe53b68ab4eda485636ea589fbd4d4e61e2
SimHash 70699b59ccc2

Groups

*

Rule Path
Disallow /media_proxy/
Disallow /interact/
Disallow /api/v1/instance/domain_blocks

ai2bot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

cohere-training-data-crawler

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

duckassistbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

kangaroo bot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

pangubot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Comments

  • .__---~~~(~~-_.
  • _-' ) -~~- ) _-" )_
  • ( ( `-,_..`.,_--_ '_,)_
  • ( -_) ( -_-~ -_ `, )
  • (_ -_ _-~-__-~`, ,' )__-'))--___--~~~--__--~~--___--__..
  • _ ~`_-'( (____;--==,,_))))--___--~~~--__--~~--__----~~~'`=__-~+_-_.
  • (@) (@) ````` `-_(())_-~
  • ,---. .=-.-..-._ ,-,--.
  • _..---. .-.,.---. .--.' \ /==/_ /==/ \ .-._ ,-.'- _\
  • .' .'.-. \ /==/ ` \ \==\-/\ \ |==|, ||==|, \/ /, /==/_ ,_.'
  • /==/- '=' /|==|-, .=., |/==/-|_\ | |==| ||==|- \| |\==\ \
  • |==|-, ' |==| '=' /\==\, - \ |==|- ||==| , | -| \==\ -\
  • |==| .=. \|==|- , .' /==/ - ,| |==| ,||==| - _ | _\==\ ,\
  • /==/- '=' ,|==|_ . ,'./==/- /\ - \|==|- ||==| /\ , |/==/\/ _ |
  • |==| - //==/ /\ , )==\ _.\=\.-'/==/. //==/, | |- |\==\ - , /
  • `-._`.___,' `--`-`--`--' `--` `--`-` `--`./ `--` `--`---'
  • AI Data Scraper
  • https://darkvisitors.com/agents/ai2bot
  • AI Search Crawler
  • https://darkvisitors.com/agents/amazonbot
  • Undocumented AI Agent
  • https://darkvisitors.com/agents/anthropic-ai
  • AI Search Crawler
  • https://darkvisitors.com/agents/applebot
  • AI Data Scraper
  • https://darkvisitors.com/agents/applebot-extended
  • AI Data Scraper
  • https://darkvisitors.com/agents/bytespider
  • AI Data Scraper
  • https://darkvisitors.com/agents/ccbot
  • AI Assistant
  • https://darkvisitors.com/agents/chatgpt-user
  • Undocumented AI Agent
  • https://darkvisitors.com/agents/claude-web
  • AI Data Scraper
  • https://darkvisitors.com/agents/claudebot
  • Undocumented AI Agent
  • https://darkvisitors.com/agents/cohere-ai
  • AI Data Scraper
  • https://darkvisitors.com/agents/cohere-training-data-crawler
  • AI Data Scraper
  • https://darkvisitors.com/agents/diffbot
  • AI Assistant
  • https://darkvisitors.com/agents/duckassistbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/facebookbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/google-extended
  • AI Data Scraper
  • https://darkvisitors.com/agents/gptbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/kangaroo-bot
  • AI Data Scraper
  • https://darkvisitors.com/agents/meta-externalagent
  • AI Assistant
  • https://darkvisitors.com/agents/meta-externalfetcher
  • AI Search Crawler
  • https://darkvisitors.com/agents/oai-searchbot
  • AI Data Scraper
  • https://darkvisitors.com/agents/omgili
  • AI Data Scraper
  • https://darkvisitors.com/agents/pangubot
  • AI Search Crawler
  • https://darkvisitors.com/agents/perplexitybot
  • AI Data Scraper
  • https://darkvisitors.com/agents/timpibot
  • AI Data Scraper
  • https://darkvisitors.com/agents/webzio-extended
  • AI Search Crawler
  • https://darkvisitors.com/agents/youbot