我正在尝试将深度嵌套的JSON转换为pandas数据帧

将深度嵌套的JSON转换为Pandas数据帧是一个常见的任务，尤其是在处理复杂的数据结构时。以下是一些基础概念和相关步骤，帮助你完成这一转换。

基础概念

JSON (JavaScript Object Notation): 一种轻量级的数据交换格式，易于人阅读和编写，也易于机器解析和生成。
Pandas: 一个强大的Python数据分析库，提供了高性能、易于使用的数据结构和数据分析工具。
数据帧 (DataFrame): Pandas中的一个二维表格型数据结构，类似于Excel表格或SQL表。

类型与应用场景

类型: JSON可以是对象、数组、字符串、数字、布尔值或null。
应用场景: 数据分析、机器学习预处理、API数据处理等。

示例代码

假设你有以下深度嵌套的JSON数据：

{
    "name": "John",
    "age": 30,
    "address": {
        "street": "123 Main St",
        "city": "Anytown",
        "zipcode": "12345"
    },
    "contacts": [
        {
            "type": "email",
            "value": "john@example.com"
        },
        {
            "type": "phone",
            "value": "555-1234"
        }
    ]
}

你可以使用以下Python代码将其转换为Pandas数据帧：

import pandas as pd
import json

# 示例JSON数据
data = {
    "name": "John",
    "age": 30,
    "address": {
        "street": "123 Main St",
        "city": "Anytown",
        "zipcode": "12345"
    },
    "contacts": [
        {
            "type": "email",
            "value": "john@example.com"
        },
        {
            "type": "phone",
            "value": "555-1234"
        }
    ]
}

# 将JSON数据转换为字典
data_dict = json.loads(json.dumps(data))

# 展平嵌套的字典
def flatten_json(y):
    out = {}

    def flatten(x, name=''):
        if type(x) is dict:
            for a in x:
                flatten(x[a], name + a + '.')
        elif type(x) is list:
            i = 0
            for a in x:
                flatten(a, name + str(i) + '.')
                i += 1
        else:
            out[name[:-1]] = x

    flatten(y)
    return out

flattened_data = flatten_json(data_dict)

# 转换为Pandas数据帧
df = pd.DataFrame([flattened_data])

print(df)

输出

   name  age address.street address.city address.zipcode contacts.0.type contacts.0.value contacts.1.type contacts.1.value
0  John   30    123 Main St      Anytown         12345            email  john@example.com           phone        555-1234