sicnoticias.pt
robots.txt
Robots Exclusion Standard data for sicnoticias.pt
Resource Scan
Scan Details
Site Domain | sicnoticias.pt |
Base Domain | sicnoticias.pt |
Scan Status | Ok |
Last Scan | 2024-11-05T13:00:46+00:00 |
Next Scan | 2024-11-12T13:00:46+00:00 |
Last Scan
Scanned | 2024-11-05T13:00:46+00:00 |
URL | https://sicnoticias.pt/robots.txt |
Domain IPs | 3.160.212.103, 3.160.212.58, 3.160.212.8, 3.160.212.99 |
Response IP | 18.165.122.83 |
Found | Yes |
Hash | f626a0f1dc4598a8b9b9311c9f78a0eb740c8703c489c36e6b60c70830695524 |
SimHash | e8104a5527b3 |
Groups
*
Rule | Path |
---|---|
Disallow | /api |
Disallow | /sites |
Disallow | /sandbox |
Disallow | /sandbox2013 |
Disallow | /pesquisa |
Disallow | /hotfolder-video-exports |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://sicnoticias.pt/sitemap/news.xml |
sitemap | https://sicnoticias.pt/sitemap/index.xml |
sitemap | https://sicnoticias.pt/sitemap/videos.xml |