cucetexam.in
robots.txt

Robots Exclusion Standard data for cucetexam.in

Resource Scan

Scan Details

Site Domain cucetexam.in
Base Domain cucetexam.in
Scan Status Ok
Last Scan2025-12-19T10:21:43+00:00
Next Scan 2026-01-18T10:21:43+00:00

Last Scan

Scanned2025-12-19T10:21:43+00:00
URL https://cucetexam.in/robots.txt
Redirect https://www.cucetexam.in/robots.txt
Redirect Domain www.cucetexam.in
Redirect Base cucetexam.in
Domain IPs 2a02:4780:11:1774:0:8e1:b827:7, 82.112.229.206
Redirect IPs 2a02:4780:11:1774:0:8e1:b827:7, 82.112.229.206
Response IP 82.112.229.206
Found Yes
Hash 8978d30cbf7a910e949d94a916f5ad36ee32d195d0ebc4c8db532921e5551d80
SimHash 6026e902c198

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /*?s=
Disallow */trackback/
Disallow */comments/feed/
Disallow */embed/
Disallow */cgi-bin/
Disallow */search/

Other Records

Field Value
sitemap https://www.cucetexam.in/sitemap_index.xml
sitemap https://www.cucetexam.in/post-sitemap.xml
sitemap https://www.cucetexam.in/page-sitemap.xml
sitemap https://www.cucetexam.in/category-sitemap.xml

Comments

  • Block unwanted URL parameters
  • Block Embed, PDF, and Reprint Sections
  • Sitemap location