newschannel.com
robots.txt

Robots Exclusion Standard data for newschannel.com

Resource Scan

Scan Details

Site Domain newschannel.com
Base Domain newschannel.com
Scan Status Ok
Last Scan2024-09-26T18:00:57+00:00
Next Scan 2024-10-03T18:00:57+00:00

Last Scan

Scanned2024-09-26T18:00:57+00:00
URL https://newschannel.com/robots.txt
Redirect https://www.newschannel.com/robots.txt
Redirect Domain www.newschannel.com
Redirect Base newschannel.com
Domain IPs 142.132.210.97
Redirect IPs 142.132.210.97
Response IP 142.132.210.97
Found Yes
Hash 986b158e7f73364eaa3346f94d28e06f87e7d17e003ee99e82d0c2cff1af002d
SimHash 1b4d0902dd90

Groups

mozilla/5.0 (compatible; mj12bot/v1.4.8; http://mj12bot.com/)

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

mozilla/5.0 (compatible; adsbot/3.1; +https://seostar.co/robot/)

Rule Path
Disallow /

mozilla/5.0 (compatible; seokicks; +https://www.seokicks.de/robot.html)

Rule Path
Disallow /

mozilla/5.0 (linux; android 7.0;) applewebkit/537.36 (khtml, like gecko) mobile safari/537.36 (compatible; petalbot;+https://webmaster.petalsearch.com/site/petalbot)

Rule Path
Disallow /

mozilla/5.0 (compatible;petalbot;+https://webmaster.petalsearch.com/site/petalbot)

Rule Path
Disallow /

mozilla/5.0 (compatible; infotigerbot/1.9; +https://infotiger.com/bot)

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
sitemap http://newschannel.com/sitemap-index.xml