inewi.pl
robots.txt

Robots Exclusion Standard data for inewi.pl

Resource Scan

Scan Details

Site Domain inewi.pl
Base Domain inewi.pl
Scan Status Ok
Last Scan2025-07-30T01:15:35+00:00
Next Scan 2025-08-29T01:15:35+00:00

Last Scan

Scanned2025-07-30T01:15:35+00:00
URL https://inewi.pl/robots.txt
Domain IPs 104.21.58.29, 172.67.155.75, 2606:4700:3032::6815:3a1d, 2606:4700:3035::ac43:9b4b
Response IP 172.67.155.75
Found Yes
Hash 1998c8ff600cb8ded1cda1c80000cea08c95815a297fc0805ad909f86b28d1cd
SimHash 030444034b31

Groups

*

Rule Path
Allow /
Disallow /admin
Disallow /Blog/post/edit
Disallow /animations/*
Disallow /cdn-cgi/speculation

fasterfox

Rule Path
Disallow /

nutch

Rule Path
Disallow /

spock

Rule Path
Disallow /

omniexplorer_bot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

becomebot

Rule Path
Disallow /

geniebot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://inewi.pl/sitemap.xml

Comments

  • *
  • Fasterfox
  • Nutch
  • spock
  • OmniExplorer_Bot
  • MJ12bot
  • TurnitinBot
  • BecomeBot
  • genieBot
  • MLBot
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.