thewebplant.com
robots.txt

Robots Exclusion Standard data for thewebplant.com

Resource Scan

Scan Details

Site Domain thewebplant.com
Base Domain thewebplant.com
Scan Status Ok
Last Scan2025-11-04T04:16:02+00:00
Next Scan 2025-12-04T04:16:02+00:00

Last Scan

Scanned2025-11-04T04:16:02+00:00
URL https://thewebplant.com/robots.txt
Redirect https://www.thewebplant.com/robots.txt
Redirect Domain www.thewebplant.com
Redirect Base thewebplant.com
Domain IPs 199.60.103.185, 199.60.103.85
Redirect IPs 199.60.103.227, 199.60.103.29, 2606:2c40::c73c:671d, 2606:2c40::c73c:67e3
Response IP 199.60.103.29
Found Yes
Hash d0f1e2364f0e97fc686c87a30440aa8d62b78462de438c6fe089dd0c54288c97
SimHash e045c870c2f2

Groups

*

Rule Path
Disallow /hubspot-themes/
Disallow /page/2/
Disallow /project-planner/
Disallow /home-detail-page
Disallow /home/home-agency/feed/
Disallow /portfolio/portfolio-4-columns/
Disallow /blog/medium-without-sidebar/
Disallow /home/home-onepage/
Disallow /features/header-4/
Disallow /_hcms/forms/
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://www.thewebplant.com/sitemap.xml