utkarsh.com
robots.txt

Robots Exclusion Standard data for utkarsh.com

Resource Scan

Scan Details

Site Domain utkarsh.com
Base Domain utkarsh.com
Scan Status Ok
Last Scan2025-05-26T12:02:59+00:00
Next Scan 2025-06-25T12:02:59+00:00

Last Scan

Scanned2025-05-26T12:02:59+00:00
URL https://utkarsh.com/robots.txt
Domain IPs 13.126.47.178, 13.126.65.129, 43.204.0.228
Response IP 13.126.65.129
Found Yes
Hash 1da3bb31d92ece8728d3f4f9d434d7a6f476fe5ad5f21644fd6c48bbd9f121bf
SimHash 4d58c815c7b3

Groups

*

Rule Path
Allow /
Disallow /current-affairs/search/
Disallow /hi/current-affairs/search/
Disallow /exams/search/
Disallow /hi/exams/search/
Disallow *?preview*

Other Records

Field Value
sitemap https://utkarsh.com/sitemap.xml

Comments

  • Disallow: /blog/blog_hn/
  • Disallow: /blog/blog_en/
  • Disallow: /blog/blog_hn/current-affairs-hindi/
  • Disallow: /blog/news-hindi/
  • Disallow: /blog/news-english/
  • links changed urls
  • Disallow: /student-corner/
  • Disallow: /hn/student-corner/
  • Disallow: /job/
  • Disallow: /hn/job/
  • Disallow: /hn/current-affairs/