Show HN: S3HyperSync – Faster S3 sync tool – iterating with up to 100k files/s
S3HyperSync is a GitHub tool for efficient file synchronization between S3-compatible services. It optimizes performance, memory, and costs, ideal for large backups. Features fast speeds, UUID Booster, and installation via JAR file or sbt assembly. Visit GitHub for details.
Read original articleS3HyperSync is a tool available on GitHub designed for efficient file synchronization between S3-compatible storage services. It is optimized for high performance, memory efficiency, and cost-effectiveness. The tool is particularly useful for creating daily backups of large S3 buckets with millions of files and terabytes of data to a separate AWS account. S3HyperSync reduces the need for expensive GetObject requests and minimizes costly MultiPart uploads. It offers fast iteration and copy speeds on AWS Fargate, along with a UUID Booster feature for quick bucket comparisons. Users can install the tool by downloading the JAR file from the Release Section or building it with sbt assembly. S3HyperSync provides various options for syncing S3 buckets efficiently. For more information on installation, usage guidelines, contributing, licensing, and acknowledgments, visit the S3HyperSync GitHub Repository.
Related
Show HN: High-frequency trading and market-making backtesting tool with examples
The GitHub URL leads to the "HftBacktest" project, a Rust framework for high-frequency trading. It offers detailed simulation, order book reconstruction, latency considerations, multi-asset backtesting, and live trading bot deployment.
Show HN: Synapse – TypeScript Toolchain for Cloud Apps
Synapse is a full-stack TypeScript toolchain with resource-driven programming, cloud-agnostic libraries, and fine-grained permissions. It includes a TypeScript compiler, fast package manager, and testing framework for local or AWS deployment. Installation instructions vary by OS. Leveraging TypeScript, esbuild, Node.js, Terraform, and AWS SDK. Detailed documentation on GitHub covers Custom Resources, Environments, Packages, and Tests, with a Quick Start guide available.
Resilient Sync for Local First
The "Local-First" concept emphasizes empowering users with data on their devices, using Resilient Sync for offline and online data exchange. It ensures consistency, security, and efficient synchronization, distinguishing content changes and optimizing processes. The method offers flexibility, conflict-free updates, and compliance documentation, with potential enhancements for data size, compression, and security.
Show HN: Standard Webhooks – simplifying 3rd party API's
Syncd simplifies webhook integrations by offering real-time data connectivity. Users can tunnel webhooks to different endpoints, test locally, and manage incoming data efficiently. The platform streamlines API integration, providing features like logging, debugging, and local testing. Join the waitlist for early access.
Combine multiple RSS feeds into a single feed, as a service
The GitHub URL provides details on "RSS Combine," a tool merging multiple RSS feeds. It guides users on local setup, configuration via YAML or environment variables, and generating a static RSS file in S3. Simplifies feed consolidation.
> For uploads, s5cmd is 32x faster than s3cmd and 12x faster than aws-cli. For downloads, s5cmd can saturate a 40Gbps link (~4.3 GB/s), whereas s3cmd and aws-cli can only reach 85 MB/s and 375 MB/s respectively.
Also, would this work well when there is not a lot of room on the disk it is syncing from? I have had serious issues with the S3 cli in such a scenario?
Also, how would this compare to something like rclone?
Seemingly not the intended use case, and I might be overlooking something, but nice to have features which the s3 sync tool has and I'd personally miss: - profiles - local sync
Related
Show HN: High-frequency trading and market-making backtesting tool with examples
The GitHub URL leads to the "HftBacktest" project, a Rust framework for high-frequency trading. It offers detailed simulation, order book reconstruction, latency considerations, multi-asset backtesting, and live trading bot deployment.
Show HN: Synapse – TypeScript Toolchain for Cloud Apps
Synapse is a full-stack TypeScript toolchain with resource-driven programming, cloud-agnostic libraries, and fine-grained permissions. It includes a TypeScript compiler, fast package manager, and testing framework for local or AWS deployment. Installation instructions vary by OS. Leveraging TypeScript, esbuild, Node.js, Terraform, and AWS SDK. Detailed documentation on GitHub covers Custom Resources, Environments, Packages, and Tests, with a Quick Start guide available.
Resilient Sync for Local First
The "Local-First" concept emphasizes empowering users with data on their devices, using Resilient Sync for offline and online data exchange. It ensures consistency, security, and efficient synchronization, distinguishing content changes and optimizing processes. The method offers flexibility, conflict-free updates, and compliance documentation, with potential enhancements for data size, compression, and security.
Show HN: Standard Webhooks – simplifying 3rd party API's
Syncd simplifies webhook integrations by offering real-time data connectivity. Users can tunnel webhooks to different endpoints, test locally, and manage incoming data efficiently. The platform streamlines API integration, providing features like logging, debugging, and local testing. Join the waitlist for early access.
Combine multiple RSS feeds into a single feed, as a service
The GitHub URL provides details on "RSS Combine," a tool merging multiple RSS feeds. It guides users on local setup, configuration via YAML or environment variables, and generating a static RSS file in S3. Simplifies feed consolidation.