news29.ru
robots.txt
Robots Exclusion Standard data for news29.ru
Resource Scan
Scan Details
Site Domain | news29.ru |
Base Domain | news29.ru |
Scan Status | Ok |
Last Scan | 2024-11-11T21:02:32+00:00 |
Next Scan | 2024-11-18T21:02:32+00:00 |
Last Scan
Scanned | 2024-11-11T21:02:32+00:00 |
URL | https://news29.ru/robots.txt |
Redirect | https://www.news29.ru/robots.txt |
Redirect Domain | www.news29.ru |
Redirect Base | news29.ru |
Domain IPs | 84.201.172.196 |
Redirect IPs | 84.201.172.196 |
Response IP | 84.201.172.196 |
Found | Yes |
Hash | d81f2454c05578c7d30d9a4e58f32a627dba6d16ef1903f19d5443b8005437e5 |
SimHash | 68111f5e4676 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin |
Disallow | /reklama/?* |
Disallow | /reklama/*?* |
Disallow | /reklama/ban* |
Disallow | /reklama?* |
Disallow | /mobile* |
Disallow | /*?* |
Disallow | /index.asp |
Disallow | /index.php |
Disallow | /index.jsp |
Disallow | /index.pl |
Disallow | /index.py |
Disallow | /novosti_za_period/* |
Disallow | /novosti/*print |
Disallow | /*/page/* |
Disallow | /glavnye_novosti_arhangelska/* |
Disallow | */glavnye_novosti_arhangelska/* |
Disallow | /pda/* |
Disallow | /?remembered |
Disallow | /?oldSite |
Disallow | /admin* |
Disallow | /manager* |
Disallow | /admin/* |
Disallow | /manager/* |
Disallow | /user* |
Other Records
Field | Value |
---|---|
sitemap | http://www.news29.ru/sitemap.xml |
Warnings
- `host` is not a known field.
Comments