trustorg.com
robots.txt
Robots Exclusion Standard data for trustorg.com
Resource Scan
Scan Details
Site Domain | trustorg.com |
Base Domain | trustorg.com |
Scan Status | Ok |
Last Scan | 2024-09-29T08:26:35+00:00 |
Next Scan | 2024-10-06T08:26:35+00:00 |
Last Scan
Scanned | 2024-09-29T08:26:35+00:00 |
URL | https://trustorg.com/robots.txt |
Domain IPs | 104.21.39.139, 172.67.146.25, 2606:4700:3033::6815:278b, 2606:4700:3035::ac43:9219 |
Response IP | 172.67.146.25 |
Found | Yes |
Hash | d808324a3862e400b5a850e4673537ed4cd56e9f571c9dc2c9db8af8be0487ac |
SimHash | ae443c14e772 |
Groups
*
Rule | Path |
---|---|
Disallow | /index.php?%2F |
Disallow | /activ/ |
Disallow | /site_trust/ |
Disallow | /js/ |
Disallow | /user/ |
Disallow | /buffer/ |
Disallow | /adminka/ |
Other Records
Field | Value |
---|---|
sitemap | https://trustorg.com/sitemaps/articles_1.xml |
sitemap | https://trustorg.com/sitemaps/phones_1.xml |
sitemap | https://trustorg.com/sitemaps/sitemap_index.xml |
Warnings
- `clean-param` is not a known field.