midnight-commander.org
robots.txt

Robots Exclusion Standard data for midnight-commander.org

Resource Scan

Scan Details

Site Domain midnight-commander.org
Base Domain midnight-commander.org
Scan Status Ok
Last Scan2024-10-04T02:39:16+00:00
Next Scan 2024-10-11T02:39:16+00:00

Last Scan

Scanned2024-10-04T02:39:16+00:00
URL https://midnight-commander.org/robots.txt
Domain IPs 140.211.15.12
Response IP 140.211.15.12
Found Yes
Hash 18f647582703e95e4f3b8c283115bac5c69edda13a4d8a325c664aade530d612
SimHash 621ec873c6e0

Groups

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

*

Rule Path
Disallow /browser
Disallow /changeset
Disallow /diff
Disallow /downloads/
Disallow /query
Disallow /log
Disallow /timeline?

Other Records

Field Value
crawl-delay 60

Comments

  • Ban rogue search engines
  • we allow the 'timeline' and 'downloads' pages itself,
  • but not with params.