fshark.com
robots.txt

Robots Exclusion Standard data for fshark.com

Resource Scan

Scan Details

Site Domain fshark.com
Base Domain fshark.com
Scan Status Ok
Last Scan2025-09-26T15:13:11+00:00
Next Scan 2025-10-26T15:13:11+00:00

Last Scan

Scanned2025-09-26T15:13:11+00:00
URL https://fshark.com/robots.txt
Domain IPs 104.21.55.59, 172.67.145.89, 2606:4700:3034::6815:373b, 2606:4700:3036::ac43:9159
Response IP 104.21.55.59
Found Yes
Hash 382f22583b7c57849e3cf598a8ed3c8b365860af11303f821fce83650b12d3cf
SimHash 254359d3fdf4

Groups

*

Rule Path
Disallow /mz
Disallow /js
Disallow /xajax_js

*

Rule Path
Disallow /blog/cgi-bin/
Disallow /blog/tag/
Disallow /blog/wp-admin/
Disallow /blog/wp-includes/
Disallow /blog/trackback/
Disallow /blog/feed/
Disallow /blog/tags/

googlebot

Rule Path
Disallow /blog/*.js$
Disallow /blog/*.inc$
Disallow /blog/*.css$
Disallow /blog/*.gz$
Disallow /blog/*.wmv$
Disallow /blog/*.cgi$
Disallow /blog/*.xhtml$

mediapartners-google*

Rule Path
Disallow
Allow /*

Other Records

Field Value
sitemap http://blog.fshark.com/sitemap.xml

Comments

  • Index
  • remova os diretorios
  • Blog do Shark
  • remova os diretorios
  • remover scrips css e afins
  • qualquer endereco que contenha ?
  • Disallow: /blog/*?*
  • permitir o Google AdSense em qualquer url
  • Sitemap