soma.org.br
robots.txt
Robots Exclusion Standard data for soma.org.br
Resource Scan
Scan Details
Site Domain | soma.org.br |
Base Domain | soma.org.br |
Scan Status | Ok |
Last Scan | 2024-09-28T01:06:41+00:00 |
Next Scan | 2024-10-05T01:06:41+00:00 |
Last Scan
Scanned | 2024-09-28T01:06:41+00:00 |
URL | https://soma.org.br/robots.txt |
Redirect | https://www.soma.org.br/robots.txt |
Redirect Domain | www.soma.org.br |
Redirect Base | soma.org.br |
Domain IPs | 3.135.145.45 |
Redirect IPs | 3.135.145.45 |
Response IP | 3.135.145.45 |
Found | Yes |
Hash | eac935995fe3557cadc24b4c87f2ef0df1e4068df58fe954134315c5c416b263 |
SimHash | 0b7d444d5b30 |
Groups
*
Rule | Path |
---|---|
Disallow | /administrator/ |
Disallow | /arquivos/ |
Disallow | /author/ |
Disallow | /bin/ |
Disallow | /cache/ |
Disallow | /cli/ |
Disallow | /components/ |
Disallow | /component/ |
Disallow | /includes/ |
Disallow | /installation/ |
Disallow | /item/ |
Disallow | /language/ |
Disallow | /layouts/ |
Disallow | /libraries/ |
Disallow | /logs/ |
Disallow | /media/ |
Disallow | /modules/ |
Disallow | /plugins/ |
Disallow | /tag/ |
Disallow | /templates/ |
Disallow | /tmp/ |
Disallow | */index.php* |
aimysitemapcrawler
Rule | Path |
---|---|
Disallow | /images/sampledata/ |
Disallow | *?tmpl=component&print=1* |
Disallow | */author/* |
Disallow | *%7B%%3Dfile.url%%7D* |
Disallow | *%7B%%3Dfile.thumbnailUrl%%7D* |
Disallow | *profileLink* |
Disallow | */avatar* |
Disallow | *?Itemid=* |
Disallow | *format%3Dfeed* |
Disallow | */item/* |
Disallow | */internasMaterias/* |
Disallow | */index.php* |
Disallow | */arquivos/* |
Disallow | */2* |
Disallow | */3* |
Disallow | */4* |
Other Records
Field | Value |
---|---|
sitemap | https://www.soma.org.br/sitemap.xml |
Comments