4html.net
robots.txt

Robots Exclusion Standard data for 4html.net

Resource Scan

Scan Details

Site Domain 4html.net
Base Domain 4html.net
Scan Status Ok
Last Scan2024-09-28T09:05:36+00:00
Next Scan 2024-10-05T09:05:36+00:00

Last Scan

Scanned2024-09-28T09:05:36+00:00
URL https://4html.net/robots.txt
Domain IPs 172.66.40.96, 172.66.43.160, 2606:4700:3108::ac42:2860, 2606:4700:3108::ac42:2ba0
Response IP 172.66.43.160
Found Yes
Hash 3c877d8b9a711748ecaed6f82cb8cefba83e89795183ce5482525af2a886f63b
SimHash c8030bc04760

Groups

*

Rule Path
Disallow /about%3Ablank
Disallow /storage/*
Disallow /source/*
Disallow /Online-Text-Editor-*
Disallow /Text-Editor-*
Disallow *.pdf
Disallow *.docx
Disallow *.xlsx
Allow /