← 返回题库
初级

数据划分-训练测试集

未完成
初级参考 完整示例代码供参考,建议自己理解后重新输入
def solve():
    import pandas as pd
    from sklearn.model_selection import train_test_split
    df = pd.read_csv('https://liangdaima.com/static/data/iris.csv')
    X = df.drop('species', axis=1)
    y = df['species']
    X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
    print('训练集: X=', X_train.shape, 'y=', y_train.shape)
    print('测试集: X=', X_test.shape, 'y=', y_test.shape)

示例

输入
solve()
期望输出
训练集: X= (120, 4) y= (120,)
测试集: X= (30, 4) y= (30,)
Python 代码 🔒 登录后使用
🔒

登录后即可练习

注册免费账号,在浏览器中直接运行 Python 代码