Fix: Code vulnerabilities and unsafe practices #350

Sambit003 · 2025-08-06T19:56:46Z

Type of Change

Related Issues

Summary of Changes

This pull request introduces a series of safety and validation improvements across several modules, focusing on robust error handling for integer overflows, resource exhaustion, and port validation. The changes are grouped into three main themes: erasure coding arithmetic safety, time/resource exhaustion protection, and port validation in service management.

Arithmetic Safety and Error Handling in Erasure Coding:

Comprehensive use of checked arithmetic (checked_add, checked_sub, checked_mul) and error propagation in Erasure and ShardReader implementations to prevent integer overflows during size calculations, block offsets, and data writes. All critical arithmetic operations now return explicit errors or safe defaults on overflow, improving reliability and maintainability. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15]

Protection Against Resource Exhaustion and Unbounded Loops:

Added maximum iteration and time jump limits in LastMinuteLatency to prevent resource exhaustion and unbounded loops, including a new safe version of get_total that returns errors if time jumps are excessive.
Ensured histogram tagging in LastMinuteHistogram does not exceed bounds by clamping the tag index.

Port Validation and Error Messaging in Service Management:

Refactored port extraction and validation in ServiceManager to include explicit range checks, reserved port warnings, and unified parsing/validation logic. Service start and restart methods now validate ports before proceeding, with improved error messaging. [1] [2] [3] [4]

Refactoring for Code Safety
Replaced unsafe AtomicPtr usage with the safer Arc<RwLock> primitive for concurrent state management in both the QueryStateMachine and the metadata Cache. This improves memory safety and reduces the risk of race conditions.
These changes collectively enhance the system's robustness against common runtime errors and improve operational safety.

Checklist

I have read and followed the CONTRIBUTING.md guidelines
Code is formatted with cargo fmt --all
Passed cargo clippy --all-targets --all-features -- -D warnings
Passed cargo check --all-targets
Added/updated necessary tests
Documentation updated (if needed)
CI/CD passed (if applicable)

Impact

Breaking change (compatibility)
Requires doc/config/deployment update
Other impact:

Additional Notes

Thank you for your contribution! Please ensure your PR follows the community standards (CODE_OF_CONDUCT.md) and sign the CLA if this is your first contribution.

…r concurrency

…QueryStateMachine

…hods

…in LastMinuteLatency

…ess and secret keys

CLAassistant · 2025-08-06T19:56:52Z

All committers have signed the CLA.

Sambit003 · 2025-08-07T03:22:31Z

IDK why the e2e-test is failing, locally all the tests have passed. May be some kind of initialization issues in the CI/CD workflow. Please check on that and let me know.
@loverustfs

loverustfs · 2025-08-07T05:39:07Z

@Sambit003

It looks like the unit test failed.


Backtrace:
  2025-08-07T01:42:46.837516Z  INFO s3s_test::runner: Test case end, summary: FnSummary { result: Err("Failed: service error"), duration_ns: 2809447, duration_ms: 2.809447 }
    at crates/s3s-test/src/runner.rs:177
    in s3s_test::runner::run_case with name: "test_assume_role"
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837532Z  INFO s3s_test::runner: Test fixture teardown
    at crates/s3s-test/src/runner.rs:149
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837581Z  INFO s3s_test::runner: Test fixture end, duration_ns: 2942778, case_count: CountSummary { total: 1, passed: 0, failed: 1 }
    at crates/s3s-test/src/runner.rs:157
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837696Z  INFO s3s_test::runner: Test suite end, duration_ns: 3770750, fixture_count: CountSummary { total: 1, passed: 0, failed: 1 }
    at crates/s3s-test/src/runner.rs:114
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837700Z  INFO s3s_test::runner: Test end, duration_ns: 7290559793, suite_count: CountSummary { total: 2, passed: 0, failed: 2 }
    at crates/s3s-test/src/runner.rs:76

FAILED 1763.527ms [Basic/Essential/test_list_buckets]
  ERROR Failed: service error
FAILED 2498.021ms [Basic/Essential/test_list_objects]
  ERROR Failed: service error
FAILED 1369.927ms [Basic/Essential/test_get_object]
  ERROR Failed: service error
FAILED 5631.614ms [Basic/Essential]
FAILED 1612.854ms [Basic/Put]
FAILED 7286.738ms [Basic]
FAILED    2.848ms [Advanced/STS/test_assume_role]
  ERROR Failed: service error
FAILED    2.943ms [Advanced/STS]
FAILED    3.771ms [Advanced]
FAILED 7290.560ms
Error: Process completed with exit code 1.

Sambit003 · 2025-08-07T06:01:33Z

@Sambit003

It looks like the unit test failed.


Backtrace:
  2025-08-07T01:42:46.837516Z  INFO s3s_test::runner: Test case end, summary: FnSummary { result: Err("Failed: service error"), duration_ns: 2809447, duration_ms: 2.809447 }
    at crates/s3s-test/src/runner.rs:177
    in s3s_test::runner::run_case with name: "test_assume_role"
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837532Z  INFO s3s_test::runner: Test fixture teardown
    at crates/s3s-test/src/runner.rs:149
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837581Z  INFO s3s_test::runner: Test fixture end, duration_ns: 2942778, case_count: CountSummary { total: 1, passed: 0, failed: 1 }
    at crates/s3s-test/src/runner.rs:157
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837696Z  INFO s3s_test::runner: Test suite end, duration_ns: 3770750, fixture_count: CountSummary { total: 1, passed: 0, failed: 1 }
    at crates/s3s-test/src/runner.rs:114
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837700Z  INFO s3s_test::runner: Test end, duration_ns: 7290559793, suite_count: CountSummary { total: 2, passed: 0, failed: 2 }
    at crates/s3s-test/src/runner.rs:76

FAILED 1763.527ms [Basic/Essential/test_list_buckets]
  ERROR Failed: service error
FAILED 2498.021ms [Basic/Essential/test_list_objects]
  ERROR Failed: service error
FAILED 1369.927ms [Basic/Essential/test_get_object]
  ERROR Failed: service error
FAILED 5631.614ms [Basic/Essential]
FAILED 1612.854ms [Basic/Put]
FAILED 7286.738ms [Basic]
FAILED    2.848ms [Advanced/STS/test_assume_role]
  ERROR Failed: service error
FAILED    2.943ms [Advanced/STS]
FAILED    3.771ms [Advanced]
FAILED 7290.560ms
Error: Process completed with exit code 1.

Yess...i've been diving deep into the issue...i'm fixing them up

loverustfs · 2025-08-07T09:34:59Z

Yess...i've been diving deep into the issue...i'm fixing them up

@Sambit003
It looks like the unit test failed.


Backtrace:
  2025-08-07T01:42:46.837516Z  INFO s3s_test::runner: Test case end, summary: FnSummary { result: Err("Failed: service error"), duration_ns: 2809447, duration_ms: 2.809447 }
    at crates/s3s-test/src/runner.rs:177
    in s3s_test::runner::run_case with name: "test_assume_role"
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837532Z  INFO s3s_test::runner: Test fixture teardown
    at crates/s3s-test/src/runner.rs:149
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837581Z  INFO s3s_test::runner: Test fixture end, duration_ns: 2942778, case_count: CountSummary { total: 1, passed: 0, failed: 1 }
    at crates/s3s-test/src/runner.rs:157
    in s3s_test::runner::run_fixture with name: "STS"
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837696Z  INFO s3s_test::runner: Test suite end, duration_ns: 3770750, fixture_count: CountSummary { total: 1, passed: 0, failed: 1 }
    at crates/s3s-test/src/runner.rs:114
    in s3s_test::runner::run_suite with name: "Advanced"

  2025-08-07T01:42:46.837700Z  INFO s3s_test::runner: Test end, duration_ns: 7290559793, suite_count: CountSummary { total: 2, passed: 0, failed: 2 }
    at crates/s3s-test/src/runner.rs:76

FAILED 1763.527ms [Basic/Essential/test_list_buckets]
  ERROR Failed: service error
FAILED 2498.021ms [Basic/Essential/test_list_objects]
  ERROR Failed: service error
FAILED 1369.927ms [Basic/Essential/test_get_object]
  ERROR Failed: service error
FAILED 5631.614ms [Basic/Essential]
FAILED 1612.854ms [Basic/Put]
FAILED 7286.738ms [Basic]
FAILED    2.848ms [Advanced/STS/test_assume_role]
  ERROR Failed: service error
FAILED    2.943ms [Advanced/STS]
FAILED    3.771ms [Advanced]
FAILED 7290.560ms
Error: Process completed with exit code 1.

Yess...i've been diving deep into the issue...i'm fixing them up

Thank you for your contribution!

Sambit003 added 8 commits August 4, 2025 23:41

refactor: replace AtomicPtr with Arc<RwLock> in Cache struct for safe…

ce05400

…r concurrency

refactor: replace AtomicPtr with Arc<RwLock> for state management in …

1736988

…QueryStateMachine

fix: prevent integer overflow in size calculations across Erasure met…

cd5630c

…hods

refactor: enhance port extraction and validation in ServiceManager

d773293

fix: add protection against unbounded loops and excessive time jumps …

5126ad7

…in LastMinuteLatency

Formatted

5bc3057

fix: improve error messages for port validation in ServiceManager

2f572aa

feat: enhance credential validation with comprehensive checks for acc…

5bceace

…ess and secret keys

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Code vulnerabilities and unsafe practices #350

Fix: Code vulnerabilities and unsafe practices #350

Sambit003 commented Aug 6, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Aug 6, 2025 •

edited

Loading

Uh oh!

Sambit003 commented Aug 7, 2025

Uh oh!

loverustfs commented Aug 7, 2025

Uh oh!

Sambit003 commented Aug 7, 2025

Uh oh!

loverustfs commented Aug 7, 2025

Uh oh!

Uh oh!

Fix: Code vulnerabilities and unsafe practices #350

Are you sure you want to change the base?

Fix: Code vulnerabilities and unsafe practices #350

Conversation

Sambit003 commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Type of Change

Related Issues

Summary of Changes

Checklist

Impact

Additional Notes

Uh oh!

CLAassistant commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sambit003 commented Aug 7, 2025

Uh oh!

loverustfs commented Aug 7, 2025

Uh oh!

Sambit003 commented Aug 7, 2025

Uh oh!

loverustfs commented Aug 7, 2025

Uh oh!

Uh oh!

Sambit003 commented Aug 6, 2025 •

edited

Loading

CLAassistant commented Aug 6, 2025 •

edited

Loading