soundboardly.com
robots.txt

Robots Exclusion Standard data for soundboardly.com

Resource Scan

Scan Details

Site Domain soundboardly.com
Base Domain soundboardly.com
Scan Status Ok
Last Scan2024-11-06T06:11:34+00:00
Next Scan 2024-11-13T06:11:34+00:00

Last Scan

Scanned2024-11-06T06:11:34+00:00
URL https://soundboardly.com/robots.txt
Domain IPs 104.21.11.30, 172.67.165.24, 2606:4700:3030::6815:b1e, 2606:4700:3036::ac43:a518
Response IP 172.67.165.24
Found Yes
Hash 6bcb694c4d2a3b93e81853042ea2cb76e4371d61d83073731275e22e72509c4d
SimHash 622cdc5a26e0

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /cgi-bin/
Allow /wp-admin/admin-ajax.php
Disallow /refer/
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php

ahrefsbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /*?*
Disallow /page
Disallow /search
Disallow /creator
Allow /*?amp=1$

Other Records

Field Value
sitemap https://www.soundboardly.com/sitemap_index.xml

Comments

  • Allow all search engines to crawl the site
  • Block access to the WordPress admin area and other unnecessary directories
  • Allow access to specific admin-ajax.php file
  • Block referral spam
  • Block common WordPress files that are not useful to be indexed
  • Block AhrefBot
  • Block search and pagination URLs
  • Sitemap directives to help search engines find your sitemaps