recruiter.careers24.com
robots.txt

Robots Exclusion Standard data for recruiter.careers24.com

Resource Scan

Scan Details

Site Domain recruiter.careers24.com
Base Domain careers24.com
Scan Status Ok
Last Scan2024-05-25T23:37:05+00:00
Next Scan 2024-06-24T23:37:05+00:00

Last Scan

Scanned2024-05-25T23:37:05+00:00
URL https://recruiter.careers24.com/robots.txt
Domain IPs 104.18.218.28, 104.18.219.28, 2606:4700::6812:da1c, 2606:4700::6812:db1c
Response IP 104.18.219.28
Found Yes
Hash a2544f80ba923f8a23679bbe3846331c73e73a1d925c2e243be700511bfb1fac
SimHash 6604d216c132

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

*

Rule Path
Disallow /_callbacks
Disallow /_controls
Disallow /_integrations
Disallow /_rewritten
Disallow /feeds
Disallow /login
Disallow /logout
Disallow /logs
Disallow /microsite
Disallow /officegame
Disallow /RadControls
Disallow /recruiter
Disallow /Uploads
Disallow /Error404.aspx?
Disallow /jobs/adverts/default.aspx
Disallow /candidate/unsubscribealert/default.aspx
Disallow /WebResource.axd
Disallow /jobs/shortlist
Disallow /ScriptResource.axd
Disallow /WebResource.axd
Disallow /candidate/register/gsp/index.html
Disallow /search-ad/index.html
Disallow /jobs/results.aspx

scrapy
simplepie
yandex
moget
ichiro
naverbot
yeti
baiduspider
baiduspider-video
baiduspider-image
sogou spider
youdaobot
twitterbot
scrapy

Rule Path
Disallow /