crowdin.com
robots.txt
Robots Exclusion Standard data for crowdin.com
Resource Scan
Scan Details
Site Domain | crowdin.com |
Base Domain | crowdin.com |
Scan Status | Ok |
Last Scan | 2024-05-25T21:38:41+00:00 |
Next Scan | 2024-06-08T21:38:41+00:00 |
Last Scan
Scanned | 2024-05-25T21:38:41+00:00 |
URL | https://crowdin.com/robots.txt |
Domain IPs | 3.220.178.183, 3.224.228.126, 54.243.50.10 |
Response IP | 54.243.50.10 |
Found | Yes |
Hash | b7dbaec9574ab61dbb0c112a04f088c57257ab86a3e03447a542f0bfeeeede97 |
SimHash | e9317a5065d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /app |
Disallow | /join |
Disallow | /login |
Disallow | /translate |
Disallow | /settings |
Disallow | /backend |
Disallow | /download |
Disallow | /case-studies/* |
Disallow | /*/js/lib/magic/* |
Disallow | /blog/post/* |
Disallow | /*? |
Other Records
Field | Value |
---|---|
sitemap | https://crowdin.com/meat/sitemap.xml |
sitemap | https://crowdin.com/blog/sitemap.xml |