diaridebalears.com
robots.txt

Robots Exclusion Standard data for diaridebalears.com

Resource Scan

Scan Details

Site Domain diaridebalears.com
Base Domain diaridebalears.com
Scan Status Ok
Last Scan2024-11-16T18:22:07+00:00
Next Scan 2024-11-23T18:22:07+00:00

Last Scan

Scanned2024-11-16T18:22:07+00:00
URL https://diaridebalears.com/robots.txt
Redirect https://www.dbalears.cat/robots.txt
Redirect Domain www.dbalears.cat
Redirect Base dbalears.cat
Domain IPs 194.224.110.188
Redirect IPs 194.224.110.188
Response IP 194.224.110.188
Found Yes
Hash aa3adbc121fc9a7d77dd71ed7145fac08dc631bead3aecc7c299553fd043f07c
SimHash 7b285fb2c094

Groups

*

Rule Path
Disallow /frontend_dev.php/
Disallow /frontend_nocache.php/
Disallow /backend.php/
Disallow /backend_dev.php/
Disallow /cron.php/
Disallow /cron_dev.php/
Disallow /sfCounter/*
Disallow /sfRating/*

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

Other Records

Field Value
sitemap https://www.wdbalears.cat/sitemap.xml
sitemap https://www.dbalears.cat/googlenews.xml
sitemap https://www.dbalears.cat/image-sitemap.xml
sitemap https://www.dbalears.cat/autors.xml
sitemap https://www.dbalears.cat/noticies/sitemapIndex.xml

Comments

  • Yahoo's Slurp Robot - Please wait 15 seconds in between visits
  • MSN Robot - Please wait 15 seconds in between visits
  • Generales
  • Autores
  • Hemeroteca

Warnings

  • 2 invalid lines.