gearepublic.com
robots.txt

Robots Exclusion Standard data for gearepublic.com

Resource Scan

Scan Details

Site Domain gearepublic.com
Base Domain gearepublic.com
Scan Status Ok
Last Scan2025-12-08T16:51:19+00:00
Next Scan 2026-01-07T16:51:19+00:00

Last Scan

Scanned2025-12-08T16:51:19+00:00
URL https://gearepublic.com/robots.txt
Domain IPs 162.241.244.139
Response IP 162.241.244.139
Found Yes
Hash f4bfaa308b126a6d838c40fab3e9235acbf14ce3b40f94fe10567837c1b3f78c
SimHash 69304d4f4812

Groups

*

Rule Path
Disallow
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php
Disallow /wp-register.php
Disallow */disclaimer/*
Disallow *?attachment_id=

Other Records

Field Value
sitemap https://gearepublic.com/freetexttospeech/sitemap_index.xml