Skip to content

Running SQL Server 2022 Anywhere

Leverage the power of RustFS to run SQL Server 2022 on any cloud (public, private, or edge) using external table functions and PolyBase.

Any to Any, All the Time

Use SQL Server 2022 data cloud to query and analyze multiple data sources residing on RustFS. Now enterprises can query data residing on RustFS from any SQL Server instance (in public cloud, private cloud, or even streaming edge instances).

Supported Deployment Environments

RustFS integration with SQL Server 2022 supports the following deployment environments:

  • AWS: Amazon Web Services cloud environment
  • GCP: Google Cloud Platform
  • Azure: Microsoft Azure cloud platform
  • Tanzu: VMware Tanzu container platform
  • OpenShift: Red Hat OpenShift container platform
  • HPE Ezmeral: HPE's container platform
  • SUSE Rancher: SUSE's Kubernetes management platform
  • Traditional Bare Metal Deployment: On-premises data center environments

Unified Data Access

Through RustFS's unified S3-compatible interface, SQL Server 2022 can:

  • Access data across multiple cloud environments
  • Eliminate data silos
  • Provide consistent query experience
  • Reduce data integration complexity

Connect to Data, Don't Move It

Using external tables, enterprises can enjoy the full functionality of SQL Server without incurring the cost or coordination challenges of moving data.

PolyBase Feature Advantages

PolyBase functionality allows users to query data directly from SQL Server and most other database installations using Transact-SQL:

Supported Data Sources

  • SQL Server: On-premises and cloud instances
  • Oracle: Enterprise-grade relational database
  • Teradata: Big data analytics platform
  • MongoDB: NoSQL document database
  • S3 API: Access object storage through RustFS

Core Advantages

  1. Zero Data Movement: Direct querying of remote data sources
  2. Unified Query Language: Use familiar T-SQL syntax
  3. Real-time Data Access: No need to pre-load data
  4. Reduced Storage Costs: Avoid duplicate data storage

Data Silo Integration

RustFS provides unique capabilities for accessing all hyperscale cloud environments. The combination of SQL Server 2022 and RustFS enables enterprises to:

  • Access data scattered across different systems
  • Gain comprehensive insights from data silos
  • Achieve unified data view
  • Simplify complex data integration scenarios

Massive Scale Performance

Massive scale performance solutions for all enterprise data.

Performance Characteristics

With this new capability, enterprises can use SQL Server 2022 for all organizational data:

Unlimited Data Scale

  • Location Agnostic: Data can be located anywhere
  • Unlimited Scale: Support for multi-petabyte data storage
  • Fast Queries: High-speed queries for massive datasets
  • Concurrent Processing: Support for multi-user concurrent access

Performance Optimization

With RustFS's industry-leading performance characteristics:

  1. High Throughput: Optimized data transfer speeds
  2. Low Latency: Fast response to query requests
  3. Intelligent Caching: Improve performance for frequently accessed data
  4. Load Balancing: Automatic query load distribution

Resource Utilization Enhancement

This means higher utilization:

  • SQL Server Utilization: More fully utilize existing SQL Server investments
  • RustFS Instance Utilization: Maximize storage resource value
  • Enterprise Data Utilization: Unlock the full value of data

Backup and Recovery

Backup and restore like you've always dreamed of.

Core Use Cases

One of the core use cases for SQL Server 2022 and RustFS is backup and restore:

Diverse Configuration Support

  • Multiple Architectures: Support different deployment architectures
  • Flexible Configuration: Adapt to various business needs
  • Scalability: Scale with business growth

Fast Recovery Capabilities

RustFS's industry-leading throughput characteristics:

  1. Time Compression: Reduce weeks of recovery time to hours
  2. High Availability: Ensure business continuity
  3. Data Integrity: Guarantee backup data integrity
  4. Automated Processes: Reduce manual intervention

Backup Strategy Optimization

Effective backup strategies include:

  • Incremental Backup: Only backup changed data
  • Differential Backup: Changes based on last full backup
  • Full Backup: Regular complete data backup
  • Instant Recovery: Fast recovery of critical business data

Secure and Available

To ensure the right data is available to the right users, fine-grained access control must be implemented on these multi-cloud data lakes.

Identity Authentication and Authorization

Third-Party IDP Integration

RustFS can integrate with third-party identity providers (IDPs):

  • Unified Identity Management: Centralized user identity management
  • Single Sign-On (SSO): Simplified user access experience
  • Multi-Factor Authentication (MFA): Enhanced security
  • Role Mapping: Automatic assignment of appropriate permissions

Access Control Mechanisms

Ensure access to object storage is limited to those who need it:

  1. Principle of Least Privilege: Only grant necessary permissions
  2. Regular Permission Reviews: Ensure permission timeliness
  3. Access Logging: Complete audit trails
  4. Anomaly Detection: Identify abnormal access behavior

Policy-Based Access Control (PBAC)

Fine-Grained Permission Management

RustFS's sophisticated PBAC functionality ensures:

  • Resource-Level Control: Precise permissions to specific resources
  • Dynamic Permission Assignment: Adjust permissions based on context
  • Policy Inheritance: Simplify permission management
  • Compliance Support: Meet regulatory requirements

Security Assurance

  • Data Encryption: Encryption protection during transmission and storage
  • Network Isolation: Secure network communication
  • Threat Detection: Real-time security threat monitoring
  • Incident Response: Rapid response to security incidents

Resilience

SQL Server is one of the most widely used analytics tools in enterprises, making it a mission-critical application.

Disaster Recovery Capabilities

Continuous Data Replication

SQL Server 2022 allows continuous data replication to and from the cloud:

  • Real-time Synchronization: Ensure data is up-to-date
  • Bidirectional Replication: Support active-active deployment
  • Conflict Resolution: Automatically handle data conflicts
  • Failover: Quick switch to backup systems

Tiered Storage Strategy

The combination with RustFS allows:

  1. Fast Storage Tier: NVMe high-speed storage
  2. Warm Storage Tier: Balance performance and cost
  3. Cold Storage Tier: Long-term archival storage
  4. Automatic Tiering: Intelligent data movement

Data Processing Capabilities

Multiple Processing Methods

Enterprises can read, write, and process big data using multiple methods:

  • Transact-SQL: Traditional SQL query language
  • Spark Libraries: Big data processing framework
  • Hybrid Analytics: Combine relational and non-relational data
  • Real-time Processing: Stream data processing capabilities

High Availability Architecture

  • Multi-site Deployment: Cross-regional data distribution
  • Active-Active Replication: Provide highest availability
  • Strict Consistency: Ensure data consistency
  • Cloud Disaster Recovery: Resist complete cloud failures

Streaming Edge

By adding external table functionality, enterprises can now set up streaming pipelines to save data on RustFS - in the cloud or on-premises.

Real-time Data Processing

Streaming Data Pipelines

  • Real-time Data Ingestion: Continuously receive streaming data
  • Data Preprocessing: Clean and transform data
  • Storage Optimization: Efficient data storage
  • Query Optimization: Query optimization for streaming data

Real-time Query Capabilities

SQL Server can be configured to execute queries on this data in real-time:

  1. Eliminate Batch Imports: No need to wait for batch processing
  2. Instant Insights: Real-time business insights
  3. Reduced Latency: Minimize data processing delays
  4. Enhanced Experience: Add new dimensions to SQL Server

Edge Computing Advantages

Edge Deployment Characteristics

  • Low Latency Processing: Process data close to source
  • Bandwidth Optimization: Reduce data transmission
  • Offline Capabilities: Support intermittent connectivity
  • Local Intelligence: Edge intelligent decision-making

Application Scenarios

  • IoT Data Processing: Internet of Things device data
  • Real-time Monitoring: System status monitoring
  • Predictive Maintenance: Equipment failure prediction
  • Smart Manufacturing: Production process optimization

Cloud as Operating Model

Cloud operating model starting from S3.

Cloud Operations Characteristics

RustFS adheres to the cloud operating model:

Core Technology Stack

  • Containerization: Containerized application deployment
  • Orchestration: Kubernetes container orchestration
  • Automation: Automated operations management
  • API-Driven: Complete API interface
  • S3 Compatibility: Standard S3 API support

Unified Interface Advantages

Provides unified interface across clouds and storage types:

  1. Simplified Development: Unified development interface
  2. Reduced Learning Costs: Standardized operation methods
  3. Improved Portability: Cross-cloud application migration
  4. Reduced Lock-in: Avoid vendor lock-in

AI/ML Framework Compatibility

Broad Framework Support

Since most AI/ML frameworks and applications are designed to use S3 API:

  • TensorFlow: Google's machine learning framework
  • PyTorch: Facebook's deep learning framework
  • Scikit-learn: Python machine learning library
  • Apache Spark: Big data processing engine

Developer Validation

With over 1.3 billion Docker pulls:

  • Most Developer Validated: Extensive developer community
  • 24/7/365 Validation: Continuous compatibility validation
  • Best Compatibility: Industry-best compatibility record
  • Production Ready: Large-scale production validated

Data Management Flexibility

This compatibility ensures:

  • AI Workload Access: Seamless access to stored data
  • Cloud Infrastructure Agnostic: Independent of specific cloud environments
  • Flexible Data Approaches: Adapt to different data processing needs
  • Cross-Cloud Environment Processing: Support multi-cloud data processing

Edge AI Storage

At the edge, network latency, data loss, and software bloat degrade performance.

Edge Optimization Features

Performance Advantages

RustFS is the world's fastest object storage:

  • Less than 100 MB: Extremely small binary files
  • Any Hardware: Can be deployed on any hardware
  • High Performance: Optimized edge performance
  • Low Resource Consumption: Minimal system requirements

Intelligent Features

RustFS's advanced features:

  1. Bucket Notifications: Storage bucket event notifications
  2. Object Lambda: Object processing functions
  3. Real-time Inference: Instant data processing
  4. Automatic Triggers: Event-based automatic processing

Edge Application Scenarios

Mission-Critical Applications

  • Airborne Object Detection: High-altitude drone applications
  • Traffic Trajectory Prediction: Autonomous vehicles
  • Industrial Control: Real-time industrial control systems
  • Security Monitoring: Real-time security monitoring

Technical Characteristics

RustFS's AI storage features:

  • Fast Response: Millisecond response times
  • Fault Tolerance: High reliability design
  • Simple Deployment: Simplified deployment process
  • Edge Optimization: Optimized for edge scenarios

Lifecycle Management for ML/AI Workloads

Modern AI/ML workloads require complex lifecycle management.

Automated Data Management

Core Functions

RustFS's lifecycle management capabilities:

  • Automated Tasks: Automatically execute data management tasks
  • Storage Optimization: Optimize storage efficiency
  • Reduced Overhead: Lower operational overhead
  • Intelligent Tiering: Automatic data tiering

Cost Optimization Strategies

With lifecycle policies:

  1. Automatic Migration: Migrate infrequently accessed data to low-cost storage
  2. Resource Release: Free resources for active workloads
  3. Storage Tiering: Multi-tier storage architecture
  4. Cost Control: Effective storage cost control

ML/AI Specialized Functions

Developer Experience

These features ensure AI/ML practitioners can:

  • Focus on Core: Focus on model training and development
  • Automatic Management: RustFS intelligently manages data
  • Performance Enhancement: Improve overall workflow performance
  • Cost Effectiveness: Achieve maximum cost effectiveness

Compliance Support

Lifecycle management layer:

  • Enforce Policies: Enforce retention and deletion policies
  • Regulatory Compliance: Ensure compliance with regulations
  • Audit Trails: Complete operation records
  • Automated Compliance: Automated compliance processes

Object Retention for AI/ML Workflows

Compared to AI/ML, fewer workloads depend more on when things happen.

Advanced Object Retention

Core Guarantees

Addressed through advanced object retention features:

  • Data Integrity: Ensure integrity of stored data
  • Compliance Requirements: Meet regulatory compliance requirements
  • Time Sensitivity: Handle time-related business needs
  • Data Consistency: Maintain data consistency

Retention Policy Implementation

By implementing retention policies, RustFS can help organizations:

  1. Model Consistency: Maintain data consistency for AI/ML models and datasets
  2. Prevent Accidental Deletion: Avoid accidental or unauthorized deletion
  3. Prevent Modification: Protect data from unauthorized modification
  4. Version Control: Maintain data version history

Data Governance Advantages

Governance Framework

This feature is crucial for:

  • Data Governance: Establish comprehensive data governance framework
  • Regulatory Compliance: Meet various regulatory requirements
  • Experiment Reproducibility: Ensure AI/ML experiment reproducibility
  • Data Lineage: Complete data lineage tracking

Guarantee Mechanisms

Guarantee critical data:

  • Specific Duration: Remain accessible for specified time
  • Data Immutability: Ensure data is not modified
  • Precise Training: Support precise model training
  • Reliable Analysis: Provide reliable data analysis foundation

Data Protection for Core AI Datasets

RustFS provides comprehensive data protection through different feature quantities.

Data Redundancy and Fault Tolerance

Protection Mechanisms

  • Erasure Coding: Efficient data redundancy mechanism
  • Site Replication: Cross-site data replication
  • Data Redundancy: Ensure redundant data storage
  • Fault Tolerance: Prevent hardware failures or data corruption

Failure Recovery

Automatically handle various failure scenarios:

  1. Hardware Failures: Automatic detection and recovery
  2. Data Corruption: Real-time detection and repair
  3. Network Failures: Automatic recovery from network interruptions
  4. Site Failures: Cross-site failover

Data Encryption Protection

Encryption Mechanisms

RustFS supports multi-level data encryption:

  • Encryption at Rest: Encryption protection for stored data
  • Encryption in Transit: Encryption during data transmission
  • Key Management: Secure key management mechanisms
  • Compliance Encryption: Encryption standards meeting compliance requirements

Access Control

  • Unauthorized Access Protection: Prevent unauthorized data access
  • Authentication: Enforce authentication mechanisms
  • Permission Control: Fine-grained permission control
  • Access Monitoring: Real-time access behavior monitoring

Identity and Access Management (IAM)

IAM Support

RustFS's IAM support enables organizations to:

  • Access Control: Control access to AI storage data
  • User Management: Unified user management
  • Application Authorization: Application access control
  • Permission Assignment: Flexible permission assignment mechanisms

Security Assurance

Ensure only authorized users or applications can:

  1. Access Data: Secure data access
  2. Modify Data: Controlled data modification
  3. Delete Data: Secure data deletion
  4. Manage Permissions: Permission management operations

Full Lifecycle Protection

Comprehensive Protection Mechanisms

RustFS provides comprehensive data protection mechanisms:

  • Integrity Protection: Maintain AI dataset integrity
  • Availability Assurance: Ensure high data availability
  • Confidentiality Protection: Protect data confidentiality
  • Lifecycle Coverage: Cover entire data lifecycle

Through deep integration of SQL Server 2022 with RustFS, enterprises can build a powerful, secure, high-performance modern data platform supporting comprehensive needs from traditional relational data processing to the latest AI/ML workloads.

Released under the Apache License 2.0.