dice.com
robots.txt

Robots Exclusion Standard data for dice.com

Resource Scan

Scan Details

Site Domain dice.com
Base Domain dice.com
Scan Status Ok
Last Scan2024-11-16T06:37:44+00:00
Next Scan 2024-11-23T06:37:44+00:00

Last Scan

Scanned2024-11-16T06:37:44+00:00
URL https://dice.com/robots.txt
Redirect https://www.dice.com/robots.txt
Redirect Domain www.dice.com
Redirect Base dice.com
Domain IPs 23.21.32.15, 3.232.136.192, 54.86.227.60
Redirect IPs 23.21.32.15, 3.232.136.192, 54.86.227.60
Response IP 23.21.32.15
Found Yes
Hash 6e5199824b409a0e7ca7dd7f87829b3ce411a873cab73a5eb470164136d32909
SimHash 028888014037

Groups

*

Rule Path
Disallow /admin
Disallow /jobman
Disallow /reports
Disallow /talentmatch
Disallow /profman
Disallow /regman
Disallow /ows
Disallow /config
Disallow /m2
Disallow /jobsearch/
Disallow /job
Disallow /feed/
Disallow /resumepost
Disallow /profile/
Disallow /rss/
Disallow /salary-calculator?title*
Disallow /jobs?q*
Disallow /jobs/?q*
Disallow /canyouhackit/
Disallow /career-paths?title
Disallow /salary-calculator-for-tech-hiring
Disallow /daf/servlet/
Disallow /jobs/dc-*
Disallow /jobs/djt-*
Disallow /products/?
Disallow *?CMPID*
Disallow */jobs.html
Disallow /career-advice/topic/*
Disallow /recruiting-advice/topic/*
Disallow /career-advice/search*
Disallow /recruiting-advice/search*
Allow /jobs
Allow /register
Allow /mobile
Allow /job-detail
Allow /support

googlebot-news

Rule Path
Disallow /
Allow /career-advice

Other Records

Field Value
sitemap https://www.dice.com/sitemap-index.xml
sitemap https://www.dice.com/career-advice/google-news-sitemap.xml

Comments

  • explicit sitemap path