newsman.ro
robots.txt

Robots Exclusion Standard data for newsman.ro

Resource Scan

Scan Details

Site Domain newsman.ro
Base Domain newsman.ro
Scan Status Ok
Last Scan2025-09-03T20:11:54+00:00
Next Scan 2025-10-03T20:11:54+00:00

Last Scan

Scanned2025-09-03T20:11:54+00:00
URL https://newsman.ro/robots.txt
Redirect https://www.newsman.ro/robots.txt
Redirect Domain www.newsman.ro
Redirect Base newsman.ro
Domain IPs 167.235.216.76
Redirect IPs 104.21.10.158, 172.67.131.140
Response IP 172.67.131.140
Found Yes
Hash 9d28e917eda14e0b467de9dc40771289021ab7171c9248e8f73d8990b04e4378
SimHash 682594208b13

Groups

*

Rule Path
Allow /
Disallow /builder/start*
Disallow /builder/templatedetails/template_id*
Disallow /partner/
Disallow /confirm/register*

Other Records

Field Value
sitemap https://www.newsman.ro/sitemap_index.xml