← 返回题库
中级

提取GBDT特征重要性前15名

未完成
中级参考 代码结构已给出,请填写 ____ 处
def solve():
    from pyodide.http import open_url
    from io import StringIO
    loans_featured_csv = open_url(____).read()
    import pandas as pd
    from sklearn.ensemble import GradientBoostingClassifier
    from sklearn.model_selection import train_test_split
    from io import StringIO
    df = pd.read_csv(StringIO(____))
    num_cols=[c for c in df.select_dtypes(____).columns if c not in ['____','____'] and not c.endswith(____)]
    X=df[num_cols].fillna(____); y=df['____']
    X_tr,X_te,y_tr,y_te=train_test_split(____)
    gb=GradientBoostingClassifier(____)
    gb.fit(____)
    imp=pd.DataFrame({'____':num_cols,'____':gb.feature_importances_.round(____)})
    print(imp.nlargest(____,'____').to_string(____))

示例

输入
solve()
期望输出
feature  importance
             subGrade      0.2160
       income_to_loan      0.0784
             term_woe      0.0608
          installment      0.0572
         interestRate      0.0532
             acc_util      0.0502
     payment_pressure      0.0436
           issue_year      0.0416
           risk_index      0.0362
                grade      0.0332
credit_history_months      0.0299
            grade_woe      0.0285
                  dti      0.0284
            revolUtil      0.0247
                 term      0.0245
Python 代码 🔒 登录后使用
🔒

登录后即可练习

注册免费账号,在浏览器中直接运行 Python 代码