msc.com.pl
robots.txt

Robots Exclusion Standard data for msc.com.pl

Resource Scan

Scan Details

Site Domain msc.com.pl
Base Domain msc.com.pl
Scan Status Ok
Last Scan2025-09-28T02:12:11+00:00
Next Scan 2025-10-28T02:12:11+00:00

Last Scan

Scanned2025-09-28T02:12:11+00:00
URL https://msc.com.pl/robots.txt
Domain IPs 217.168.143.15
Response IP 217.168.143.15
Found Yes
Hash 9e4aaaf56d7b728c5ac0605d331b41af25eb228886453185deebd77d59fb52d3
SimHash 98709f534f66

Groups

activeagent
emailsiphon
extractorpro

Rule Path
Disallow /

*

Rule Path
Disallow /_BIOSS
Disallow /_msctest08
Disallow /autobackup2
Disallow /cezar/m
Disallow /cezar1
Allow /cezar1/fots
Disallow /cezar22
Disallow /clock
Disallow /cron

Comments

  • Robots Exclusion file - this is used to disallow access
  • to webwalkers conforming to the defacto standard
  • Organisation: MSC.COM.PL
  • Webmaster: admin at msc.com.pl
  • Format is:
  • User-agent: <name of spider>
  • Disallow: <nothing> | <path>
  • -----------------------------------------------------------------------------
  • Updated: 2024-02-15
  • For some agents, disallow the whole web site:
  • User-agent: Googlebot-Mobile
  • For all user-agents, disallow private sub-systems: