soma.org.br
robots.txt

Robots Exclusion Standard data for soma.org.br

Resource Scan

Scan Details

Site Domain soma.org.br
Base Domain soma.org.br
Scan Status Ok
Last Scan2024-09-28T01:06:41+00:00
Next Scan 2024-10-05T01:06:41+00:00

Last Scan

Scanned2024-09-28T01:06:41+00:00
URL https://soma.org.br/robots.txt
Redirect https://www.soma.org.br/robots.txt
Redirect Domain www.soma.org.br
Redirect Base soma.org.br
Domain IPs 3.135.145.45
Redirect IPs 3.135.145.45
Response IP 3.135.145.45
Found Yes
Hash eac935995fe3557cadc24b4c87f2ef0df1e4068df58fe954134315c5c416b263
SimHash 0b7d444d5b30

Groups

*

Rule Path
Disallow /administrator/
Disallow /arquivos/
Disallow /author/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /component/
Disallow /includes/
Disallow /installation/
Disallow /item/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /tag/
Disallow /templates/
Disallow /tmp/
Disallow */index.php*

aimysitemapcrawler

Rule Path
Disallow /images/sampledata/
Disallow *?tmpl=component&print=1*
Disallow */author/*
Disallow *%7B%%3Dfile.url%%7D*
Disallow *%7B%%3Dfile.thumbnailUrl%%7D*
Disallow *profileLink*
Disallow */avatar*
Disallow *?Itemid=*
Disallow *format%3Dfeed*
Disallow */item/*
Disallow */internasMaterias/*
Disallow */index.php*
Disallow */arquivos/*
Disallow */2*
Disallow */3*
Disallow */4*

Other Records

Field Value
sitemap https://www.soma.org.br/sitemap.xml

Comments

  • Default Aimy Sitemap robots.txt for Joomla!