warsaw.cc
robots.txt

Robots Exclusion Standard data for warsaw.cc

Resource Scan

Scan Details

Site Domain warsaw.cc
Base Domain warsaw.cc
Scan Status Ok
Last Scan2025-07-04T02:54:13+00:00
Next Scan 2025-08-03T02:54:13+00:00

Last Scan

Scanned2025-07-04T02:54:13+00:00
URL https://warsaw.cc/robots.txt
Domain IPs 104.26.2.65, 104.26.3.65, 172.67.73.135, 2606:4700:20::681a:241, 2606:4700:20::681a:341, 2606:4700:20::ac43:4987
Response IP 104.26.3.65
Found Yes
Hash 35dbd7c072474d49e6e8fdbfc32e4ee4ce348e83aa025ac385f16d8ffe8f0002
SimHash 6918ddc267b7

Groups

*

Rule Path
Disallow /calendar/action*
Disallow /events/action*
Allow /*.css
Allow /*.js
Disallow /*?

Other Records

Field Value
crawl-delay 3

claudebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Comments

  • Default Flywheel robots file
  • ClaudeBot-specific rules