hearstctads.com
robots.txt

Robots Exclusion Standard data for hearstctads.com

Resource Scan

Scan Details

Site Domain hearstctads.com
Base Domain hearstctads.com
Scan Status Ok
Last Scan2024-09-09T21:25:46+00:00
Next Scan 2024-10-09T21:25:46+00:00

Last Scan

Scanned2024-09-09T21:25:46+00:00
URL https://hearstctads.com/robots.txt
Domain IPs 174.34.58.123
Response IP 174.34.58.123
Found Yes
Hash 0957f65cd92ae748d793e9c7f2323ae3f7458da37a09baf93e781ec4f00045fe
SimHash b29e0115e975

Groups

*

Rule Path
Disallow /connecticut-adportal/flow.html
Disallow /connecticut-adportal/sflow.html
Disallow /connecticut-adportal/usernamereminder.html
Disallow /connecticut-adportal/passwordreminder.html
Disallow /connecticut-adportal/passwordreset.html
Disallow /connecticut-adportal/adduser.html
Disallow /connecticut-adportal/home/
Disallow /connecticut-adportal/admin/
Disallow /connecticut-adportal/j_acegi_security_check
Disallow /*flow.html
Disallow /*sflow.html
Disallow /*usernamereminder.html
Disallow /*passwordreminder.html
Disallow /*passwordreset.html
Disallow /*adduser.html
Disallow /*j_acegi_security_check
Disallow /*viewFile.html
Disallow /connecticut-adportal-test/

Other Records

Field Value
crawl-delay 20

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

lexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

gethpinfo.com-bot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

veooz

Rule Path
Disallow /

wikido

Rule Path
Disallow /

yeti

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

anybot

Rule Path
Disallow /

aaabot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

re-re studio (+http://vip0.ru/)

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /