feat: Optimize DynamoDB online store for improved latency #5889

ntkathole · 2026-01-22T08:52:50Z

What this PR does / why we need it:

Optimizes the DynamoDB online store implementation to reduce online feature serving latency.

O(1) dictionary lookup instead of O(n log n) sorting - _process_batch_get_response now uses dictionary-based lookup for response ordering instead of sorting.
Cached TypeDeserializer - Added class-level cached TypeDeserializer instance to avoid per-request object instantiation overhead in async reads.
Entity ID computation caching - _to_entity_ids now caches computed entity IDs within a request to avoid redundant hashing for duplicate entity keys.
VPC endpoint support for async client - The endpoint_url config is now properly passed to the async aiobotocore client, enabling DynamoDB VPC endpoints for reduced network latency.
Improved default configuration values:
- batch_size: 40 → 100 (max allowed by DynamoDB BatchGetItem)
- max_pool_connections: 10 → 50 (better concurrency)
- keepalive_timeout: 12s → 30s (better connection reuse)
- connect_timeout: 60s → 5s (faster failure detection)
- read_timeout: 60s → 10s (faster failure detection)
- total_max_retry_attempts: None → 3 (bounded retries)
- retry_mode: None → "adaptive" (smart retry with rate limiting)
Pre-allocated result lists - Response processing now pre-allocates result lists instead of using append-based growth.

Signed-off-by: ntkathole <nikhilkathole2683@gmail.com>

ntkathole self-assigned this Jan 22, 2026

feat: Optimize DynamoDB online store for improved latency

275eac2

Signed-off-by: ntkathole <nikhilkathole2683@gmail.com>

ntkathole force-pushed the perf_improvements branch from 64342e4 to 275eac2 Compare January 22, 2026 09:09