gremistas.net
robots.txt

Robots Exclusion Standard data for gremistas.net

Resource Scan

Scan Details

Site Domain gremistas.net
Base Domain gremistas.net
Scan Status Ok
Last Scan2024-11-14T09:42:21+00:00
Next Scan 2024-11-21T09:42:21+00:00

Last Scan

Scanned2024-11-14T09:42:21+00:00
URL https://gremistas.net/robots.txt
Domain IPs 104.26.12.3, 104.26.13.3, 172.67.72.74, 2606:4700:20::681a:c03, 2606:4700:20::681a:d03, 2606:4700:20::ac43:484a
Response IP 172.67.72.74
Found Yes
Hash 63bfa17c8858f0f03d2659e92fd9cbf620d57ff9ad10ee24a3c874206284eb84
SimHash ec5154c0c403

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-json/
Disallow /?rest_route=
Disallow /cdn-cgi/
Disallow /*/*/*/feed/
Disallow /author/*/page/
Disallow /autor/*/page/
Disallow /reportar-erro/?pst=*
Disallow /search/
Disallow /?s=
Disallow /page/*/?s=

mediapartners-google

Rule Path
Allow /

facebookexternalhit

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.gremistas.net/news-sitemap.xml
sitemap https://www.gremistas.net/sitemap_index.xml

Comments

  • Remove
  • Search URLs
  • AdSense
  • Facebook debugger
  • Sitemaps