thenewsmarket.com
robots.txt

Robots Exclusion Standard data for thenewsmarket.com

Resource Scan

Scan Details

Site Domain thenewsmarket.com
Base Domain thenewsmarket.com
Scan Status Ok
Last Scan2024-06-24T00:29:07+00:00
Next Scan 2024-07-24T00:29:07+00:00

Last Scan

Scanned2024-06-24T00:29:07+00:00
URL https://thenewsmarket.com/robots.txt
Redirect https://www.thenewsmarket.com/robots.txt
Redirect Domain www.thenewsmarket.com
Redirect Base thenewsmarket.com
Domain IPs 52.6.194.240
Redirect IPs 13.33.88.120, 13.33.88.29, 13.33.88.64, 13.33.88.91, 2600:9000:223b:1a00:2:4a9b:ddc0:93a1, 2600:9000:223b:4600:2:4a9b:ddc0:93a1, 2600:9000:223b:9e00:2:4a9b:ddc0:93a1, 2600:9000:223b:a00:2:4a9b:ddc0:93a1, 2600:9000:223b:b600:2:4a9b:ddc0:93a1, 2600:9000:223b:b800:2:4a9b:ddc0:93a1, 2600:9000:223b:cc00:2:4a9b:ddc0:93a1, 2600:9000:223b:e00:2:4a9b:ddc0:93a1
Response IP 13.33.88.120
Found Yes
Hash 006e1192b76e2f72a1069dd20a5f9392ea7c94ed3682dc64a29bf9f8673327dc
SimHash 29145b40cf53

Groups

*

Rule Path
Allow /
Disallow *handler%3D*
Disallow *categoryid%3D*
Disallow *panelid%3D*
Disallow *tags%3D*

Other Records

Field Value
sitemap https://www.thenewsmarket.com/sitemap/sitemap.xml
sitemap https://www.thenewsmarket.com/sitemap/news-sitemap.xml