premiadedalt.cat
robots.txt

Robots Exclusion Standard data for premiadedalt.cat

Resource Scan

Scan Details

Site Domain premiadedalt.cat
Base Domain premiadedalt.cat
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2025-08-30T08:12:53+00:00
Next Scan 2025-11-28T08:12:53+00:00

Last Successful Scan

Scanned2025-04-10T05:14:41+00:00
URL https://premiadedalt.cat/robots.txt
Domain IPs 137.135.184.158
Response IP 137.135.184.158
Found Yes
Hash 8808359f5e99f3f4e03aab2434616dbbbe97cb6296467f14d65ac5aaa8cde12c
SimHash ac51ab554d65

Groups

*

Rule Path
Disallow

googlebot

Rule Path
Disallow /*?
Disallow /*atct_album_view$
Disallow /*folder_factories$
Disallow /*folder_summary_view$
Disallow /*login_form$
Disallow /*mail_password_form$
Disallow /*search
Disallow /*search_rss$
Disallow /*sendto_form$
Disallow /*summary_view$
Disallow /*thumbnail_view$
Disallow /*view$
Disallow /home-page/
Disallow /fitxers/
Disallow /home-page/

Other Records

Field Value
sitemap https://www.premiadedalt.cat/sitemap.xml.gz

Comments

  • Define access-restrictions for robots/spiders
  • http://www.robotstxt.org/wc/norobots.html
  • By default we allow robots to access all areas of our site
  • already accessible to anonymous users
  • Add Googlebot-specific syntax extension to exclude forms
  • that are repeated for each piece of content in the site
  • the wildcard is only supported by Googlebot
  • http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx=sibling