celsojunior.net
robots.txt

Robots Exclusion Standard data for celsojunior.net

Resource Scan

Scan Details

Site Domain celsojunior.net
Base Domain celsojunior.net
Scan Status Ok
Last Scan2025-11-30T01:38:55+00:00
Next Scan 2025-12-07T01:38:55+00:00

Last Scan

Scanned2025-11-30T01:38:55+00:00
URL https://celsojunior.net/robots.txt
Domain IPs 104.21.37.220, 172.67.213.157, 2606:4700:3035::ac43:d59d, 2606:4700:3036::6815:25dc
Response IP 172.67.213.157
Found Yes
Hash fb528cc3a3961dc18dc614bdbc5a358e4f1fde8a5c73efe19cd7030158c239d7
SimHash 68053b59d3a6

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /blog/wp-admin/
Disallow /blog/wp-includes/
Disallow /blog/wp-content/plugins/
Disallow /blog/wp-content/themes/
Disallow /blog/wp-content/cache/
Disallow /download/
Disallow /i/
Disallow /images/
Disallow /shop/
Disallow /static/
Disallow /*index.html$

googlebot

Rule Path
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /*?*

mediapartners-google*

Rule Path
Disallow
Allow /*

googlebot-image

Rule Path
Allow /blog/wp-content/uploads/

Other Records

Field Value
sitemap http://www.celsojunior.net/blog/sitemap_index.xml

Comments

  • XML-SITEMAP
  • DISALLOW DIRECTORIES
  • DISALLOW SCRIPTS AND CSS
  • DISALLOW URL WITH ?
  • ALLOW ADSENSE
  • ALLOW GOOGLE IMAGE BOT