colatv.info
robots.txt

Robots Exclusion Standard data for colatv.info

Resource Scan

Scan Details

Site Domain colatv.info
Base Domain colatv.info
Scan Status Ok
Last Scan2026-01-25T23:37:03+00:00
Next Scan 2026-02-24T23:37:03+00:00

Last Scan

Scanned2026-01-25T23:37:03+00:00
URL https://colatv.info/robots.txt
Redirect https://www.lisjobnet.com/robots.txt
Redirect Domain www.lisjobnet.com
Redirect Base lisjobnet.com
Domain IPs 104.21.5.107, 172.67.133.85, 2606:4700:3035::ac43:8555, 2606:4700:3036::6815:56b
Redirect IPs 104.21.74.75, 172.67.200.114, 2606:4700:3035::ac43:c872, 2606:4700:3037::6815:4a4b
Response IP 172.67.200.114
Found Yes
Hash 62f4f54184e919afa12adaeccdbedadd4ce8a86a349ba776127c2b54ea8ffddd
SimHash eb4ff8eaea33

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-content/cache/

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webdownloader

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offlineexplorer

Rule Path
Disallow /

httrack

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

httpweazel

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.lisjobnet.com/sitemap.xml