Skip to content

[Feature]: improve system stability #1981

@leonrayang

Description

@leonrayang

Is your feature request related to a problem? Please describe.

  • master leader service all the interface but follower only service several interfaces include client/partitios, that because the client/partitions cost many flow if the dp count reach a high level. we used the raft log to pass the cache from leader to follower, thus the follower can support the request, then reduce the leader's pressure of flow
    but the cache too large, while the network unstable, raft need resend the log strictly, which need more memory to build and deal the network packet. so there's need made new channel but not raft log.
  • The addition of node decommissioning rate limiting feature enhances system stability.
  • Support for master-slave switch migration state reload to reduce atomicity issues during migration.
  • Increased handling capabilities for scenarios such as disk permissions, bad disks, and missing dp.
  • Improved accuracy of data partition status and space statistics, enhanced stability in dp unloading scenarios.
  • Master interface whitelist rate limiting and added interface statistics.
  • Reduced memory consumption and mitigated OOM (Out-of-Memory) issues in the master.
  • Resolved data security risks caused by idempotence issues.
  • Enhanced memory persistence and restart loading reliability for MP (Memory Persistence) to address replica anomalies caused by snapshot interruptions.
  • Streamlined and optimized state machine processes for DP (Data Partition), MP, and replica, improved display of replica status in CLI (Command Line Interface) tools.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions