worldwebnewspapers.com
robots.txt

Robots Exclusion Standard data for worldwebnewspapers.com

Resource Scan

Scan Details

Site Domain worldwebnewspapers.com
Base Domain worldwebnewspapers.com
Scan Status Ok
Last Scan2024-09-01T04:13:42+00:00
Next Scan 2024-10-01T04:13:42+00:00

Last Scan

Scanned2024-09-01T04:13:42+00:00
URL https://worldwebnewspapers.com/robots.txt
Domain IPs 104.21.6.165, 172.67.135.8, 2606:4700:3033::ac43:8708, 2606:4700:3034::6815:6a5
Response IP 172.67.135.8
Found Yes
Hash 0b6259926ddc876e9df831c66362540c1471fec9763a6d158dc8a1932e46bc8c
SimHash 394ddd04c591

Groups

*

Rule Path
Disallow /*.gif$
Disallow /*.Gif$
Disallow /*.GIF$
Disallow /*.jpg$
Disallow /*.Jpg$
Disallow /*.JPG$
Disallow /*.jpeg$
Disallow /*.Jpeg$
Disallow /*.JPEG$
Disallow /*.pdf$
Disallow /*.Pdf$
Disallow /*.PDF$
Disallow /*.zip$
Disallow /*.Zip$
Disallow /*.ZIP$

Other Records

Field Value
sitemap http://www.worldwebnewspapers.com/sitemap.gz