netwerk24.com
robots.txt

Robots Exclusion Standard data for netwerk24.com

Resource Scan

Scan Details

Site Domain netwerk24.com
Base Domain netwerk24.com
Scan Status Ok
Last Scan2024-06-26T22:08:18+00:00
Next Scan 2024-07-03T22:08:18+00:00

Last Scan

Scanned2024-06-26T22:08:18+00:00
URL https://netwerk24.com/robots.txt
Redirect https://www.netwerk24.com/robots.txt
Redirect Domain www.netwerk24.com
Redirect Base netwerk24.com
Domain IPs 104.19.231.81, 104.19.232.81, 2606:4700::6813:e751, 2606:4700::6813:e851
Redirect IPs 104.19.231.81, 104.19.232.81, 2606:4700::6813:e751, 2606:4700::6813:e851
Response IP 104.19.232.81
Found Yes
Hash 7e337c2a7614102e55c36230051f4799e2943f8cbe26a2da09ae862543965033
SimHash 741c5951edb1

Groups

*

Rule Path
Allow /
Allow */cricket/test/
Allow */cricketworldcup2019/Test/
Disallow /toets/
Disallow */toets/*
Disallow /test/
Disallow */_test/*
Disallow */testpolar/*
Disallow */test/*
Disallow /xArchive/Archive/Illegal-liquor-export-20010319
Disallow /.well-known/
Disallow /assetlinks.json

twitterbot

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /BreakingNewsSms

mauibot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

dataprovider-com

Rule Path
Disallow /

dcrawl

Rule Path
Disallow /

httrack

Rule Path
Disallow /

httrack-3-0

Rule Path
Disallow /

metainspector

Rule Path
Disallow /

newspaper

Rule Path
Disallow /

nutch

Rule Path
Disallow /

offline-explorer

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

domainstatsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

hypestat

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

screaming-frog-seo-spider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

zoombot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.netwerk24.com/sitemap

Comments

  • AI Assistants
  • AI Data Scrapers
  • AI Search Crawlers
  • Scrapers
  • SEO Crawlers
  • Undocumented AI Agents

Warnings

  • 2 invalid lines.