jornada.com.bo
robots.txt
Robots Exclusion Standard data for jornada.com.bo
Resource Scan
Scan Details
Site Domain | jornada.com.bo |
Base Domain | jornada.com.bo |
Scan Status | Ok |
Last Scan | 2024-09-24T21:44:26+00:00 |
Next Scan | 2024-10-01T21:44:26+00:00 |
Last Scan
Scanned | 2024-09-24T21:44:26+00:00 |
URL | https://jornada.com.bo/robots.txt |
Domain IPs | 104.21.59.181, 172.67.182.63, 2606:4700:3031::ac43:b63f, 2606:4700:3033::6815:3bb5 |
Response IP | 104.21.59.181 |
Found | Yes |
Hash | fc2831de0f379a632ce767aff0dbf517d81df7087e9046da493580c5666e2f51 |
SimHash | ee814c502c13 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /xmlrpc.php |
Disallow | /wp-content/plugins/ |
Disallow | /wp-content/cache/ |
Disallow | /wp-content/themes/ |
Disallow | /trackback/ |
Disallow | /feed/ |
Disallow | /comments/ |
Disallow | /category/ |
Disallow | /*? |
Allow | /wp-content/uploads/ |
Allow | /ads/preferences/ |
Allow | /gpt/ |
Allow | /pagead/show_ads.js |
Allow | /pagead/js/adsbygoogle.js |
Allow | /pagead/js/*/show_ads_impl.js |
Allow | /static/glade.js |
Allow | /static/glade/ |
Other Records
Field | Value |
---|---|
sitemap | https://jornada.com.bo/sitemap_index.xml |
sitemap | https://jornada.com.bo/news-sitemap.xml |