elgeneracionalpost.com
robots.txt
Robots Exclusion Standard data for elgeneracionalpost.com
Resource Scan
Scan Details
Site Domain | elgeneracionalpost.com |
Base Domain | elgeneracionalpost.com |
Scan Status | Ok |
Last Scan | 2024-11-17T04:56:30+00:00 |
Next Scan | 2024-11-18T04:56:30+00:00 |
Last Scan
Scanned | 2024-11-17T04:56:30+00:00 |
URL | https://elgeneracionalpost.com/robots.txt |
Domain IPs | 192.0.78.147, 192.0.78.202 |
Response IP | 192.0.78.147 |
Found | Yes |
Hash | 5ce2a178bc4ca101db8994491907b2a1a91f58d3b6b6c9fcdeb560644d8762eb |
SimHash | b5d048d2c6d7 |
Groups
*
Rule | Path |
---|---|
Allow | /actualidad |
Allow | /noticias/ciencia |
Allow | /noticias/noticias-internacionales-de-ultima-hora |
Allow | /noticias/cultura |
Allow | /noticias/deportes |
Allow | /noticias/politica |
Allow | /noticias/fotoperiodismo |
Allow | /noticias/opinion |
Disallow | /author/ |
Other Records
Field | Value |
---|---|
sitemap | https://elgeneracionalpost.com/news-sitemap.xml |
Warnings
- `https` is not a known field.
Comments