giornali.it
robots.txt

Robots Exclusion Standard data for giornali.it

Resource Scan

Scan Details

Site Domain giornali.it
Base Domain giornali.it
Scan Status Ok
Last Scan2024-09-26T10:54:31+00:00
Next Scan 2024-10-03T10:54:31+00:00

Last Scan

Scanned2024-09-26T10:54:31+00:00
URL https://giornali.it/robots.txt
Domain IPs 51.210.235.192
Response IP 51.210.235.192
Found Yes
Hash 048bd750c6ffa48f600772628c039a142a519489245095f0a93374129b011f6a
SimHash 8d045db31787

Groups

googlebot

Rule Path
Allow *page/2/
Allow *page/3/
Allow *page/4/
Allow *page/5/
Allow *page/6/
Allow *page/7/
Allow *page/8/
Allow *page/9/
Allow *page/10/
Disallow *page/
Disallow *t-date%3D
Disallow *filter%3D
Disallow *type%3D
Disallow /js/addtohomescreen.js
Disallow /ws/share
Disallow /search

Other Records

Field Value
sitemap https://giornali.it/sitemap.php