cefaluweb.com
robots.txt

Robots Exclusion Standard data for cefaluweb.com

Resource Scan

Scan Details

Site Domain cefaluweb.com
Base Domain cefaluweb.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-01T17:39:02+00:00
Next Scan 2024-10-31T17:39:02+00:00

Last Successful Scan

Scanned2024-07-04T17:38:03+00:00
URL https://cefaluweb.com/robots.txt
Domain IPs 104.21.44.76, 172.67.197.75, 2606:4700:3031::6815:2c4c, 2606:4700:3035::ac43:c54b
Response IP 172.67.197.75
Found Yes
Hash 10cbff3172331f5cd4800df937ef9c0a55eeb316ad0a878f39a5f0387ed3ba3c
SimHash 6a3ec8d27499

Groups

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

bingbot

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /trackback/
Disallow /feed/
Disallow /comments/
Disallow /author/
Disallow /category/
Disallow /tag/
Disallow /?s=
Disallow /*/feed/
Disallow /*/trackback/
Disallow /*/comments/
Disallow /*.php$
Disallow /?attachment_id=
Disallow /*?replytocom
Allow /wp-content/uploads/
Allow /wp-content/themes/*/assets/
Allow /wp-content/plugins/*/assets/

Other Records

Field Value
crawl-delay 10

googlebot-news

Rule Path
Allow /category/
Allow /tag/

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

*

Rule Path
Disallow /*blackhole
Disallow /?blackhole

*

Rule Path
Disallow /wp-content/uploads/wpo-plugins-tables-list.json

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://cefaluweb.com/sitemap-news.xml
sitemap https://cefaluweb.com/sitemap.xml

Comments

  • XML Sitemap & Google News version 5.3.6 - https://status301.net/wordpress-plugins/xml-sitemap-feed/
  • Allow full access to important bots
  • Block unnecessary and duplicate content
  • Specific exclusions for efficiency
  • Allow essential scripts and styles
  • Delay requests to reduce server load
  • Allow Google News to access specific directories
  • Block specific bots known for high server load
  • Enhanced directives for Google
  • Block blackhole traps
  • Block JSON data file
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK