spectrumincgc.com
robots.txt

Robots Exclusion Standard data for spectrumincgc.com

Resource Scan

Scan Details

Site Domain spectrumincgc.com
Base Domain spectrumincgc.com
Scan Status Ok
Last Scan2026-01-25T10:52:08+00:00
Next Scan 2026-02-24T10:52:08+00:00

Last Scan

Scanned2026-01-25T10:52:08+00:00
URL https://spectrumincgc.com/robots.txt
Domain IPs 3.14.105.18
Response IP 3.14.105.18
Found Yes
Hash 9fe3b4f7b1c814a5876f1942d345ea8c2072749615850770e4d8cd286b107f02
SimHash 01645d06f613

Groups

*

Rule Path
Disallow /wp-admin/*
Disallow /wp-login.php
Disallow /wp-includes/*
Disallow /wp-content/*
Disallow /trackback
Disallow /feed
Disallow */comments
Disallow *?replytocom
Disallow */comments-page-*
Disallow */trackback
Disallow */feed
Disallow */comments
Disallow *?s=
Disallow /search/
Allow /wp-content/cache/*
Allow /wp-content/uploads/*
Allow /wp-content/themes/*
Allow /wp-content/plugins/*
Allow /wp-includes/js/*
Allow /wp-includes/css/*

Other Records

Field Value
crawl-delay 10

Comments

  • foxscan:wp-1.0.3