[{"_1":2,"_26":-5,"_27":-5},"loaderData",{"_3":4,"_7":8},"root",{"_5":6},"canonical","https://blog.hushukang.com/en/blog/01946e9f-756b-76d1-9a11-f15e7349dd2f/","routes/$lang.blog.$id",{"_9":10},"article",{"_11":12,"_13":14,"_15":16,"_17":18,"_19":20},"id","01946e9f-756b-76d1-9a11-f15e7349dd2f","publishTime",["D",1737076012844],"title","Amazon DynamoDB: A Comprehensive Guide to Core Concepts","content","Amazon DynamoDB is a serverless NoSQL database that delivers high performance and scalability for applications of all sizes. This article explains the core concepts of DynamoDB in detail, including primary key design, data types, table definitions, CRUD operations, transactions, Global Secondary Indexes (GSI), and best practices for table design.\r\n\r\n---\r\n\r\n## 1. Primary Key\r\n\r\n### Structure of Primary Keys\r\n\r\nIn DynamoDB, every table requires a primary key. There are two types of primary keys:\r\n\r\n1. **Partition Key (PK)**: A single attribute key that determines which partition the data is stored in.\r\n2. **Partition Key + Sort Key (SK)**: A composite key consisting of two attributes. The partition key determines the partition, and the sort key specifies the order within the partition.\r\n\r\nIn both cases, the primary key must ensure **uniqueness**.\r\n\r\nExample:\r\n\r\n```json\r\n{\r\n \"PK\": \"USER#12345\",\r\n \"SK\": \"USER_INFO\"\r\n}\r\n```\r\n\r\n### Recommended Primary Key Design\r\n\r\nThe **Partition Key + Sort Key** format is recommended. For example, when storing user data, set the PK to `USER#` and the SK to `USER_INFO`.\r\n\r\n1. The first part of the PK represents a constant indicating the data type (e.g., `USER`), and the second part represents the user ID.\r\n2. The SK represents the type or context of the data using constants or dynamic values.\r\n\r\nThis design ensures uniqueness and enables efficient querying.\r\n\r\n---\r\n\r\n## 2. Data Types\r\n\r\nDynamoDB supports the following data types:\r\n\r\n- **Scalar types**: String (S), Number (N), Binary (B), Boolean (BOOL)\r\n- **Document types**: Map (M), List (L)\r\n- **Set types**: String Set (SS), Number Set (NS), Binary Set (BS)\r\n\r\n**Notes:**\r\n\r\n- **Flexibility**: Document types like Map and List are ideal for storing nested data.\r\n- **Indexing**: Only scalar types can be used as partition keys or sort keys.\r\n\r\nExample:\r\n\r\n```json\r\n{\r\n \"PK\": \"USER#12345\",\r\n \"SK\": \"USER_INFO\",\r\n \"Name\": \"Alice\",\r\n \"Age\": 30,\r\n \"Preferences\": {\r\n \"Language\": \"English\",\r\n \"TimeZone\": \"UTC+9\"\r\n },\r\n \"Tags\": [\"Developer\", \"Writer\"]\r\n}\r\n```\r\n\r\n---\r\n\r\n## 3. Table Definition\r\n\r\nDynamoDB tables can be defined using CloudFormation or AWS CDK.\r\n\r\n### CloudFormation Example:\r\n\r\n```yaml\r\nResources:\r\n :\r\n Type: AWS::DynamoDB:Table\r\n Properties:\r\n TableName: \r\n AttributeDefinitions:\r\n - AttributeName: \r\n AttributeType: \r\n - AttributeName: \r\n AttributeType: \r\n KeySchema:\r\n - AttributeName: \r\n AttributeType: HASH\r\n - AttributeName: \r\n AttributeType: RANGE\r\n BillingMode: PAY_PER_REQUEST\r\n```\r\n\r\n### CDK Example:\r\n\r\n```typescript\r\nimport * as dynamodb from 'aws-cdk-lib/aws-dynamodb';\r\n\r\nconst table = new dynamodb.Table(this, '', {\r\n tableName: '',\r\n partitionKey: { name: '', type: dynamodb.AttributeType. },\r\n sortKey: { name: '', type: dynamodb.AttributeType.STRING. },\r\n billingMode: dynamodb.BillingMode.PAY_PER_REQUEST,\r\n});\r\n```\r\n\r\n**Parameter Explanation:**\r\n\r\n- ``: Resource name in CloudFormation/CDK.\r\n- ``: DynamoDB table name.\r\n- ``: Name of the partition key.\r\n- ``: Type of the partition key.\r\n- ``: Name of the sort key.\r\n- ``: Type of the sort key.\r\n\r\n---\r\n\r\n## 4. Database Operations\r\n\r\n### Initialization\r\n\r\nInstall the required dependencies:\r\n\r\n```text\r\nnpm install @aws-sdk/client-dynamodb @aws-sdk/lib-dynamodb\r\n```\r\n\r\nInitialize the DynamoDB client:\r\n\r\n```typescript\r\n// dynamodb.util.ts\r\nimport { DynamoDB } from '@aws-sdk/client-dynamodb';\r\nimport { DynamoDBDocumentClient } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst dbClient = new DynamoDB({});\r\n\r\nconst marshallOptions = {\r\n convertEmptyValues: false, // Default: false\r\n removeUndefinedValues: false, // Default: false\r\n convertClassInstanceToMap: false, // Default: false\r\n};\r\n\r\nconst unmarshallOptions = {\r\n wrapNumbers: false, // Default: false\r\n};\r\n\r\nconst translateConfig = { marshallOptions, unmarshallOptions };\r\n\r\nconst docClient = DynamoDBDocumentClient.from(dbClient, translateConfig);\r\n\r\nexport { docClient };\r\n```\r\n\r\n### Key Operations\r\n\r\n#### Add Data\r\n\r\nAdd a new user to the `user_table`:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { PutCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new PutCommand({\r\n TableName: 'user_table',\r\n Item: {\r\n pk: 'USER#12345',\r\n sk: 'USER_INFO',\r\n name: 'Alice',\r\n age: 30,\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n#### Delete Data\r\n\r\nDelete a user with ID **12345**:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { DeleteCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new DeleteCommand({\r\n TableName: 'user_table',\r\n Key: {\r\n pk: 'USER#12345',\r\n sk: 'USER_INFO',\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n#### Update Data\r\n\r\nUpdate the `name` attribute of a user with ID **12345**:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { UpdateCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new UpdateCommand({\r\n TableName: 'user_table',\r\n Key: {\r\n pk: 'USER#12345',\r\n sk: 'USER_INFO',\r\n },\r\n UpdateExpression: 'set #name = :name',\r\n ExpressionAttributeNames: {\r\n '#name': 'name',\r\n },\r\n ExpressionAttributeValues: {\r\n ':name': 'Bob',\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n#### Retrieve Single Item\r\n\r\nRetrieve a user with ID **12345**:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { GetCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new GetCommand({\r\n TableName: 'user_table',\r\n Key: {\r\n pk: 'USER#12345',\r\n sk: 'USER_INFO',\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n#### Retrieve Multiple Items\r\n\r\nRetrieve users whose `age` is between 20 and 30:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { ScanCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new ScanCommand({\r\n TableName: 'user_table',\r\n FilterExpression: '#sk = :sk and #age between :start and :end',\r\n ExpressionAttributeNames: {\r\n '#sk': 'sk',\r\n '#age': 'age',\r\n },\r\n ExpressionAttributeValues: {\r\n ':sk': 'USER',\r\n ':start': 20,\r\n ':end': 30,\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\nNote: Scans are less efficient than using indexes. Use **GSI (Global Secondary Index)** whenever possible.\r\n\r\n#### Simultaneous updates to multiple data using transactions\r\n\r\nDynamoDB supports ACID transactions, allowing multiple operations to succeed or fail together.\r\n\r\nThe following example updates the name attributes of two users at the same time:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { TransactWriteCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new TransactWriteCommand({\r\n TransactItems: [\r\n {\r\n Update: {\r\n TableName: 'user_table',\r\n Key: {\r\n pk: 'USER#12345',\r\n sk: 'USER_INFO',\r\n },\r\n UpdateExpression: 'set #name = :name',\r\n ExpressionAttributeNames: {\r\n '#name': 'name',\r\n },\r\n ExpressionAttributeValues: {\r\n ':name': 'Bob',\r\n },\r\n },\r\n },\r\n {\r\n Update: {\r\n TableName: 'user_table',\r\n Key: {\r\n pk: 'USER#56789',\r\n sk: 'USER_INFO',\r\n },\r\n UpdateExpression: 'set #name = :name',\r\n ExpressionAttributeNames: {\r\n '#name': 'name',\r\n },\r\n ExpressionAttributeValues: {\r\n ':name': 'Lisa',\r\n },\r\n },\r\n },\r\n ],\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n---\r\n\r\n## 5. Global Secondary Index (GSI)\r\n\r\n### Overview of GSI\r\n\r\nA global secondary index (GSI) allows you to execute queries using a key that is different from the existing partition key or sort key of the table. This improves query performance.\r\n\r\nFor example, when retrieving users with age between 20 and 30 in user_table, you can retrieve data efficiently by utilizing a GSI without performing a full table scan.\r\n\r\n### Defining a GSI\r\n\r\nExample using CloudFormation:\r\n\r\n```yaml\r\nResources:\r\n WorkTable:\r\n Type: AWS::DynamoDB::Table\r\n Properties:\r\n TableName: user_table\r\n AttributeDefinitions:\r\n - AttributeName: pk\r\n AttributeType: S\r\n - AttributeName: sk\r\n AttributeType: S\r\n - AttributeName: age # add age attribute\r\n AttributeType: N # type: number\r\n KeySchema:\r\n - AttributeName: pk\r\n KeyType: HASH\r\n - AttributeName: sk\r\n KeyType: RANGE\r\n BillingMode: PAY_PER_REQUEST\r\n GlobalSecondaryIndexes:\r\n - IndexName: UserAgeIndex # GSI name\r\n KeySchema:\r\n - AttributeName: sk # GSI pk\r\n KeyType: HASH\r\n - AttributeName: age # GSI sk\r\n KeyType: RANGE\r\n Projection:\r\n ProjectionType: ALL\r\n```\r\n\r\n### Query Example Using GSI\r\n\r\nHere's an example of running a query using the above GSI (`UserAgeIndex`):\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { QueryCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new QueryCommand({\r\n TableName: 'user_table',\r\n IndexName: 'UserAgeIndex',\r\n KeyConditionExpression: '#sk = :sk and #age between :start and :end',\r\n ExpressionAttributeNames: {\r\n '#sk': 'sk',\r\n '#age': 'age',\r\n },\r\n ExpressionAttributeValues: {\r\n ':sk': 'USER',\r\n ':start': 20,\r\n ':end': 30,\r\n },\r\n ScanIndexForward: false, // false: descending order, true: ascending order (default is true)\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n---\r\n\r\n## 6. Best Practices for Table Design\r\n\r\nThe design of DynamoDB differs from relational databases, as it focuses on optimizing access patterns. Below is an example:\r\n\r\n**Data Structure:**\r\n\r\n- **Departments**: Department ID, Department Name\r\n- **Employees**: Employee ID, Employee Name, Email\r\n- **Attendance Records**: Date, Check-in Time, Check-out Time, Break Time\r\n\r\n**Search Requirements**:\r\n\r\n- Retrieve a list of departments.\r\n- Retrieve employee information by Employee ID.\r\n- Retrieve employees belonging to a specific department using Department ID.\r\n- Retrieve monthly attendance records of an employee using Employee ID and month/year.\r\n\r\nIn relational databases, the design would look like this:\r\n\r\n![rds](/article-assets/01946e9f-756b-76d1-9a11-f15e7349dd2f/rds.svg)\r\n\r\nFor DynamoDB, the design would look like this:\r\n\r\n![dynamodb](/article-assets/01946e9f-756b-76d1-9a11-f15e7349dd2f/dynamodb.svg)\r\n\r\n### Retrieve a List of Departments\r\n\r\nTo retrieve all department entries from the `EmployeeTable`, use a query where the primary key `pk` is set to `DEPARTMENT`:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { QueryCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new QueryCommand({\r\n TableName: 'EmployeeTable',\r\n KeyConditionExpression: 'pk = :pk',\r\n ExpressionAttributeValues: {\r\n ':pk': 'DEPARTMENT',\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n### Retrieve Employee Information by ID\r\n\r\nTo retrieve specific employee information, set `pk` to `Employee#` and `sk` to `INFO`:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { GetCommand } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new GetCommand({\r\n TableName: 'EmployeeTable',\r\n Key: {\r\n pk: 'Employee#',\r\n sk: 'INFO',\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n### Retrieve Employees by Department ID\r\n\r\nUsing a GSI (`DepartmentIndex`), retrieve a list of employees belonging to a specific department by querying the departmentId:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { QueryCommandInput } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new QueryCommandInput({\r\n TableName: 'EmployeeTable',\r\n IndexName: 'DepartmentIndex',\r\n KeyConditionExpression: 'departmentId = :departmentId',\r\n ExpressionAttributeValues: {\r\n ':departmentId': '',\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n### Retrieve Monthly Attendance Records of an Employee\r\n\r\nFilter attendance records by setting `pk` to `Employee#` and using a prefix `WORK#` for the `sk`:\r\n\r\n```typescript\r\nimport { docClient } from './dynamodb.util';\r\nimport { QueryCommandInput } from '@aws-sdk/lib-dynamodb';\r\n\r\nconst command = new QueryCommandInput({\r\n TableName: 'EmployeeTable',\r\n KeyConditionExpression: 'pk = :pk and begins_with(sk, :sk)',\r\n ExpressionAttributeValues: {\r\n ':pk': 'Employee#',\r\n ':sk': 'WORK#202501', // Attendance records for January 2025\r\n },\r\n});\r\nconst result = await docClient.send(command);\r\n```\r\n\r\n---\r\n\r\n### Best Practices\r\n- **Define Access Patterns Clearly**: DynamoDB design revolves around \"how data will be accessed.\"\r\n- **Single-Table Design**: Store different types of data in one table using partition and sort keys.\r\n- **Use Indexes**: Leverage GSI and LSI for flexible queries.\r\n- **Minimize Scans**: Use keys or indexes to query data efficiently.\r\n- **Utilize Transactions**: Ensure data consistency with DynamoDB's ACID transactions.\r\n\r\n---\r\n\r\nWith these principles, you can maximize DynamoDB's potential for scalable and efficient data operations. Let me know if you'd like to explore further details or specific use cases!\r\n","tags",[21],{"_22":23,"_24":25},"name","AWS","color","#FF9900","actionData","errors"]