capitalprof.site
robots.txt

Robots Exclusion Standard data for capitalprof.site

Resource Scan

Scan Details

Site Domain capitalprof.site
Base Domain capitalprof.site
Scan Status Ok
Last Scan2025-10-06T07:31:42+00:00
Next Scan 2025-11-05T07:31:42+00:00

Last Scan

Scanned2025-10-06T07:31:42+00:00
URL https://capitalprof.site/robots.txt
Domain IPs 104.21.5.217, 172.67.133.224, 2606:4700:3031::6815:5d9, 2606:4700:3035::ac43:85e0
Response IP 104.21.5.217
Found Yes
Hash 7bd401276e955cb0d3a0201a96000849ccb1016445b200b1d9751f45a52745bf
SimHash 6904e7144311

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-*
Disallow /*?*
Allow */uploads
Allow /images
Allow /wp-*/*.js
Allow /wp-*/*.css
Allow /wp-*/*.jpg
Allow /wp-*/*.png
Allow /wp-*/*.svg
Allow /wp-*/*.ttf
Allow /wp-*/*.gif
Allow /wp-*/*.webp
Allow /wp-*/*.woff2

Other Records

Field Value
sitemap https://capitalprof.site/sitemap_index.xml