thenewsmarket.com
robots.txt

Robots Exclusion Standard data for thenewsmarket.com

Resource Scan

Scan Details

Site Domain thenewsmarket.com
Base Domain thenewsmarket.com
Scan Status Ok
Last Scan2024-10-22T00:47:35+00:00
Next Scan 2024-11-21T00:47:35+00:00

Last Scan

Scanned2024-10-22T00:47:35+00:00
URL https://thenewsmarket.com/robots.txt
Redirect https://www.thenewsmarket.com/robots.txt
Redirect Domain www.thenewsmarket.com
Redirect Base thenewsmarket.com
Domain IPs 52.6.194.240
Redirect IPs 13.33.88.120, 13.33.88.29, 13.33.88.64, 13.33.88.91, 2600:9000:223b:2800:2:4a9b:ddc0:93a1, 2600:9000:223b:6e00:2:4a9b:ddc0:93a1, 2600:9000:223b:7200:2:4a9b:ddc0:93a1, 2600:9000:223b:7600:2:4a9b:ddc0:93a1, 2600:9000:223b:9800:2:4a9b:ddc0:93a1, 2600:9000:223b:bc00:2:4a9b:ddc0:93a1, 2600:9000:223b:d600:2:4a9b:ddc0:93a1, 2600:9000:223b:f800:2:4a9b:ddc0:93a1
Response IP 13.33.88.91
Found Yes
Hash 006e1192b76e2f72a1069dd20a5f9392ea7c94ed3682dc64a29bf9f8673327dc
SimHash 29145b40cf53

Groups

*

Rule Path
Allow /
Disallow *handler%3D*
Disallow *categoryid%3D*
Disallow *panelid%3D*
Disallow *tags%3D*

Other Records

Field Value
sitemap https://www.thenewsmarket.com/sitemap/sitemap.xml
sitemap https://www.thenewsmarket.com/sitemap/news-sitemap.xml