cincainews.com
robots.txt

Robots Exclusion Standard data for cincainews.com

Resource Scan

Scan Details

Site Domain cincainews.com
Base Domain cincainews.com
Scan Status Ok
Last Scan2026-02-09T10:33:13+00:00
Next Scan 2026-02-16T10:33:13+00:00

Last Scan

Scanned2026-02-09T10:33:13+00:00
URL https://cincainews.com/robots.txt
Redirect https://www.cincainews.com/robots.txt
Redirect Domain www.cincainews.com
Redirect Base cincainews.com
Domain IPs 104.21.78.70, 172.67.217.218, 2606:4700:3030::ac43:d9da, 2606:4700:3032::6815:4e46
Redirect IPs 104.21.78.70, 172.67.217.218, 2606:4700:3030::ac43:d9da, 2606:4700:3032::6815:4e46
Response IP 172.67.217.218
Found Yes
Hash 2a665090a3265e6c2cad2da8c3a9cfd8aa3847330d1582f960bcca2d494b5b9e
SimHash b12df8c76abe

Groups

*

Rule Path
Disallow /ajax/*
Disallow /print*
Disallow /getRelatedArticles*
Disallow /getMostReadArticles*
Disallow /article_count/*
Disallow /get-menu-header*
Disallow /search*
Disallow /morearticles/*
Disallow /article.php*
Disallow /login-mgt
Disallow /*.php
Disallow /archive/*
Disallow /rss
Disallow /rssFeed/*
Disallow /widget/*
Disallow */page/*
Disallow /test/
Disallow /test*

Other Records

Field Value
sitemap https://www.cincainews.com/sitemap.xml