startingtoknow.com
robots.txt

Robots Exclusion Standard data for startingtoknow.com

Resource Scan

Scan Details

Site Domain startingtoknow.com
Base Domain startingtoknow.com
Scan Status Ok
Last Scan2026-02-04T19:35:31+00:00
Next Scan 2026-02-11T19:35:31+00:00

Last Scan

Scanned2026-02-04T19:35:31+00:00
URL https://startingtoknow.com/robots.txt
Domain IPs 35.212.66.223
Response IP 35.212.66.223
Found Yes
Hash d382b3de26df6bf302b91461d5b1773601c1cd88e19c72dfd73d1b751322651a
SimHash 71d099f71cf4

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin/
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow /*?printpdf=
Disallow /author/
Disallow */embed
Disallow */page/
Disallow */xmlrpc.php
Disallow *utm%3D
Disallow *openstat%3D
Allow /wp-/ajax.php
Allow */wp-sitemap
Allow */uploads
Allow /wp-/*.js
Allow /wp-/*.css
Allow /wp-/*.png
Allow /wp-/*.jpg
Allow /wp-/*.jpeg
Allow /wp-/*.gif
Allow /wp-/*.svg
Allow /wp-/*.webp
Allow /wp-/*.pdf

Other Records

Field Value
sitemap https://startingtoknow.com/sitemap_index.xml