linhaca.net.br
robots.txt

Robots Exclusion Standard data for linhaca.net.br

Resource Scan

Scan Details

Site Domain linhaca.net.br
Base Domain linhaca.net.br
Scan Status Ok
Last Scan2025-11-17T02:38:22+00:00
Next Scan 2025-11-24T02:38:22+00:00

Last Scan

Scanned2025-11-17T02:38:22+00:00
URL https://linhaca.net.br/robots.txt
Redirect https://www.linhaca.net.br/robots.txt
Redirect Domain www.linhaca.net.br
Redirect Base linhaca.net.br
Domain IPs 104.21.5.137, 172.67.133.128, 2606:4700:3031::ac43:8580, 2606:4700:3033::6815:589
Redirect IPs 104.21.5.137, 172.67.133.128, 2606:4700:3031::ac43:8580, 2606:4700:3033::6815:589
Response IP 104.21.5.137
Found Yes
Hash 08dcb6f0112481bb0e9bea6d6a01cb1f05d584833d1b421b72564225e9c0ebaa
SimHash 2954139207e2

Groups

*

Rule Path
Allow /
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/cache
Disallow /wp-content/plugins
Disallow /wp-content/upgrades
Disallow /wp-login
Disallow /trackback
Disallow /comments
Disallow /author
Allow /wp-content/uploads
Allow /thumbs-carros

googlebot*

Rule Path
Disallow /*.php$
Disallow /*.inc$
Disallow /*.cgi$
Disallow /*.xhtml$

Other Records

Field Value
sitemap https://www.linhaca.net.br/sitemap.xml
sitemap https://www.linhaca.net.br/sitemap.xml.gz

Comments

  • robots.txt file for https://www.linhaca.net.br/
  • Template version: 20171218
  • Last update of this robots.txt file: 12/09/23 at 04:10
  • Avoid indexing some directories
  • Allow others
  • Avoid indexing somes file extensions
  • Sitemap
  • 56e8a116e6e4e81251a993c87b100cdbf9b174c6f0762d03b4f2fe93b075666d