trudbox.kz
robots.txt

Robots Exclusion Standard data for trudbox.kz

Resource Scan

Scan Details

Site Domain trudbox.kz
Base Domain trudbox.kz
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-03-12T05:19:22+00:00
Next Scan 2024-06-10T05:19:22+00:00

Last Successful Scan

Scanned2023-05-14T21:10:12+00:00
URL https://trudbox.kz/robots.txt
Redirect http://trudbox.kz/robots.txt
Domain IPs 104.21.21.100, 172.67.197.238, 2606:4700:3035::6815:1564, 2606:4700:3036::ac43:c5ee
Response IP 172.67.197.238
Found Yes
Hash 6d56c16648e64df9e2aa74552eb6506bd1f43ab8e28d1b3bef96a9bd1e4ae097
SimHash 5510dd46c3b4

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow *?
Disallow /admin/
Disallow /showjob
Disallow /forget_password
Disallow /register.jobseeker
Disallow */similar/*
Disallow /user/
Disallow /employer/
Disallow /resume/
Allow /resume/create
Allow *?v

Other Records

Field Value
sitemap http://trudbox.kz/sitemap-main.xml

Warnings

  • `host` is not a known field.