← 返回题库
初级

对比log变换前后的偏度

未完成
初级参考 完整示例代码供参考,建议自己理解后重新输入
def solve():
    from pyodide.http import open_url
    from io import StringIO
    loans_clean_csv = open_url("https://data.zuihe.com/dbd/riskctrl/state_03/loans_clean.csv").read()
    loans_featured_csv = open_url("https://data.zuihe.com/dbd/riskctrl/state_03/loans_featured.csv").read()
    iv_table_csv = open_url("https://data.zuihe.com/dbd/riskctrl/state_03/iv_table.csv").read()
    import pandas as pd
    from io import StringIO
    df = pd.read_csv(StringIO(loans_featured_csv))
    print(df['log_income'].describe().round(2).to_string())
    print(f"annualIncome偏度: {df['annualIncome'].skew():.4f}")
    print(f"log_income偏度: {df['log_income'].skew():.4f}")
    print(f"revolBal偏度: {df['revolBal'].skew():.4f}")
    print(f"log_revolBal偏度: {df['log_revolBal'].skew():.4f}")

示例

输入
solve()
期望输出
count    10000.00
mean        11.09
std          0.54
min          0.00
25%         10.74
50%         11.08
75%         11.43
max         12.43
annualIncome偏度: 1.6698
log_income偏度: -0.8717
revolBal偏度: 9.9630
log_revolBal偏度: -2.6654
Python 代码 🔒 登录后使用
🔒

登录后即可练习

注册免费账号,在浏览器中直接运行 Python 代码