inithtml.com
robots.txt

Robots Exclusion Standard data for inithtml.com

Resource Scan

Scan Details

Site Domain inithtml.com
Base Domain inithtml.com
Scan Status Ok
Last Scan2025-07-20T03:16:53+00:00
Next Scan 2025-07-27T03:16:53+00:00

Last Scan

Scanned2025-07-20T03:16:53+00:00
URL https://inithtml.com/robots.txt
Domain IPs 104.21.7.98, 172.67.130.24, 2606:4700:3031::ac43:8218, 2606:4700:3036::6815:762
Response IP 104.21.7.98
Found Yes
Hash 5d9af9ba795d530086f076038d3a344894e3ebf74f87643ddd87c40dda4adf63
SimHash c9264d44e9b2

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /*?s=
Disallow /*?replytocom=
Disallow /*?amp
Disallow /*?noamp
Disallow /amp/
Disallow /wp-json/
Disallow /?rest_route=
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://inithtml.com/sitemap.xml