cz.trud.com
robots.txt
Robots Exclusion Standard data for cz.trud.com
Resource Scan
Scan Details
Site Domain | cz.trud.com |
Base Domain | trud.com |
Scan Status | Ok |
Last Scan | 2024-09-16T16:22:05+00:00 |
Next Scan | 2024-10-16T16:22:05+00:00 |
Last Scan
Scanned | 2024-09-16T16:22:05+00:00 |
URL | https://cz.trud.com/robots.txt |
Domain IPs | 104.21.53.21, 172.67.207.200, 2606:4700:3036::ac43:cfc8, 2606:4700:3037::6815:3515 |
Response IP | 104.21.53.21 |
Found | Yes |
Hash | 0638e451e169b362de038b627e43e7f2c2512f123207d95d48fcea49d5713be9 |
SimHash | 3296ee02ca33 |
Groups
*
Rule | Path |
---|---|
Allow | /company.html?page= |
Disallow | *? |
Disallow | */search/ |
Disallow | /ads/show/ |
Disallow | /jobs/show |
Disallow | /cvs/show |
Disallow | /crm/* |
Disallow | /crm2/* |
Disallow | /resume/ |
Disallow | /resume.html |
Disallow | /js/utils/* |
Disallow | /office/ |
Disallow | /site/redirect/url |
Disallow | */vacancies.html |
Disallow | */leave-feedback.html |
Disallow | */salary.html |
Disallow | */photos.html |
Disallow | */pros-and-cons.html |
Disallow | */about.html |
Disallow | /viewed/ |
Other Records
Field | Value |
---|---|
sitemap | https://cz.trud.com/sitemap/cz.trud.com-sitemap.xml.gz |
Warnings
- `host` is not a known field.