← 返回题库
初级

统计各字段缺失率并按缺失率降序排列

未完成
初级参考 完整示例代码供参考,建议自己理解后重新输入
def solve():
    from pyodide.http import open_url
    from io import StringIO
    loans_raw_csv = open_url("https://data.zuihe.com/dbd/riskctrl/state_00/loans_raw.csv").read()
    import pandas as pd
    from io import StringIO
    df = pd.read_csv(StringIO(loans_raw_csv))
    miss = df.isnull().mean().round(4).reset_index()
    miss.columns = ['column','missing_rate']
    miss = miss[miss['missing_rate']>0].sort_values('missing_rate',ascending=False)
    print(miss.to_string(index=False))

示例

输入
solve()
期望输出
column  missing_rate
               n11        0.0860
  employmentLength        0.0587
                n0        0.0510
               n13        0.0510
               n12        0.0510
                n2        0.0510
                n1        0.0510
                n6        0.0510
                n5        0.0510
                n7        0.0510
                n3        0.0510
               n14        0.0510
                n8        0.0510
                n9        0.0510
                n4        0.0421
               n10        0.0421
         revolUtil        0.0004
pubRecBankruptcies        0.0003
               dti        0.0001
Python 代码 🔒 登录后使用
🔒

登录后即可练习

注册免费账号,在浏览器中直接运行 Python 代码