soapcentral.com
robots.txt

Robots Exclusion Standard data for soapcentral.com

Resource Scan

Scan Details

Site Domain soapcentral.com
Base Domain soapcentral.com
Scan Status Ok
Last Scan2024-11-13T18:35:12+00:00
Next Scan 2024-11-20T18:35:12+00:00

Last Scan

Scanned2024-11-13T18:35:12+00:00
URL https://soapcentral.com/robots.txt
Redirect https://www.soapcentral.com/robots.txt
Redirect Domain www.soapcentral.com
Redirect Base soapcentral.com
Domain IPs 104.18.0.78, 104.18.1.78, 2606:4700::6812:14e, 2606:4700::6812:4e
Redirect IPs 104.18.0.78, 104.18.1.78, 2606:4700::6812:14e, 2606:4700::6812:4e
Response IP 104.18.1.78
Found Yes
Hash 7ba054cccd72f81350e844a82a8cbc93f0ac1c1386c3362449b642634453fa9e
SimHash 2000400509b3

Groups

*

Rule Path
Allow /ads.txt
Disallow /_admin/
Disallow /dan/
Disallow /soapcentral/admin/
Disallow /soapcentral/content/
Disallow /templates/
Disallow /soapcentral/content/_old
Disallow /*.inc$
Disallow /*?printonly=yes

Other Records

Field Value
sitemap https://www.soapcentral.com/sitemap.xml
sitemap https://www.soapcentral.com/sitemap.html
sitemap https://www.soapcentral.com/sitemap_images.xml
sitemap https://www.soapcentral.com/sitemap_video.xml
sitemap https://www.soapcentral.com/feed_rss.xml
sitemap https://static.soapcentral.com/api_generated/news-sitemap-en.xml

Comments

  • Disallow: /ads/