lavoroecarriere.it
robots.txt

Robots Exclusion Standard data for lavoroecarriere.it

Resource Scan

Scan Details

Site Domain lavoroecarriere.it
Base Domain lavoroecarriere.it
Scan Status Ok
Last Scan2024-06-15T09:43:55+00:00
Next Scan 2024-06-22T09:43:55+00:00

Last Scan

Scanned2024-06-15T09:43:55+00:00
URL https://lavoroecarriere.it/robots.txt
Redirect https://www.lavoroecarriere.it/robots.txt
Redirect Domain www.lavoroecarriere.it
Redirect Base lavoroecarriere.it
Domain IPs 104.21.59.192, 172.67.182.237, 2606:4700:3030::6815:3bc0, 2606:4700:3031::ac43:b6ed
Redirect IPs 104.21.59.192, 172.67.182.237, 2606:4700:3030::6815:3bc0, 2606:4700:3031::ac43:b6ed
Response IP 104.21.59.192
Found Yes
Hash 5cefa6750274c3f521f91bbafc41b9b25d46d08ec2971f2d09f5cb7c0e21f72a
SimHash c8298a90c3f2

Groups

googlebot
*

Rule Path
Disallow /wp-admin/
Disallow /giornaleonline/
Disallow /app-test/

googlebot-news

Rule Path
Allow /ultima-ora/
Disallow /

Other Records

Field Value
sitemap http://www.lavoroecarriere.it/sitemap_index.xml
sitemap http://www.lavoroecarriere.it/lavoro-news-sitemap/sitemap-news.xml

Comments

  • Googlebot
  • Disallow: *.css
  • Disallow: *.js
  • Other bot spider
  • User-agent: *
  • Disallow: /giornaleonline/
  • Disallow: /app-test/
  • Disallow: /.js$*
  • Disallow: /.inc$*
  • Disallow: /.css$*
  • Disallow: /.php$*