{ "cells": [ { "cell_type": "markdown", "id": "8a4a74dc-5958-4c8c-abdb-784135994d35", "metadata": {}, "source": [ "\"Header\"" ] }, { "cell_type": "markdown", "id": "571b7f4d-d7d6-4a80-8288-b6eb5d76901c", "metadata": {}, "source": [ "## Assessment: Enhancing Data Science Outcomes With Efficient Workflow ##\n", "In this notebook, you will utilize what you've learned in this workshop to complete an assessment. The assessment has been divided into a couple of steps to guide your development. You will be graded based on the performance of your classification model. Note that this coding portion does not give partial credit - it shows up as either 0 or 60 points. \n", "
Step Points
0. Setting Up
1. Data Loading
2. Feature Engineering
3. Model Development
4. Model Persistence 60
" ] }, { "cell_type": "markdown", "id": "559c1a85-e36c-4396-93fb-524048c66684", "metadata": {}, "source": [ "

" ] }, { "cell_type": "markdown", "id": "43e4be02-6163-4282-ad82-892dee6d17a0", "metadata": {}, "source": [ "### Step 0: Setting Up ###\n", "For the assessment we are asking you to create a classification model over a similar dataset as the workshop. You'll need to leverage distributed computing with a Dask cluster of GPU workers. Your task is to prepare and train a classifier that accurately predicts a binary outcome. \n", "\n", "**Instructions**:
\n", "0.1 Modify the `` only and execute the below cell to create a CUDA cluster.
\n", "0.2 Modify the `` only and execute the cell below to instantiate a Dask client that connects to the CUDA cluster.
\n", "0.3 Execute the cell below to import the other dependencies. " ] }, { "cell_type": "code", "execution_count": 1, "id": "7c1a3928-8802-4df6-a384-a1e29f78e48c", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "2025-11-08 14:31:46,707 - distributed.preloading - INFO - Creating preload: dask_cuda.initialize\n", "2025-11-08 14:31:46,707 - distributed.preloading - INFO - Import preload module: dask_cuda.initialize\n", "2025-11-08 14:31:46,748 - distributed.preloading - INFO - Creating preload: dask_cuda.initialize\n", "2025-11-08 14:31:46,748 - distributed.preloading - INFO - Import preload module: dask_cuda.initialize\n", "2025-11-08 14:31:46,753 - distributed.preloading - INFO - Creating preload: dask_cuda.initialize\n", "2025-11-08 14:31:46,753 - distributed.preloading - INFO - Import preload module: dask_cuda.initialize\n", "2025-11-08 14:31:46,777 - distributed.preloading - INFO - Creating preload: dask_cuda.initialize\n", "2025-11-08 14:31:46,777 - distributed.preloading - INFO - Import preload module: dask_cuda.initialize\n" ] } ], "source": [ "# 0.1\n", "# import dependencies\n", "from dask_cuda import LocalCUDACluster\n", "\n", "# instantiate a Client\n", "cluster=LocalCUDACluster()" ] }, { "cell_type": "code", "execution_count": 2, "id": "ed7b74c9-2679-4aad-851d-664d6b0c68b0", "metadata": { "tags": [] }, "outputs": [], "source": [ "# 0.2\n", "# import dependencies\n", "from dask.distributed import Client, wait\n", "client=Client(cluster)" ] }, { "cell_type": "code", "execution_count": 3, "id": "0353dff8-1d06-4107-9fb6-3ffdb5c99f39", "metadata": { "tags": [] }, "outputs": [], "source": [ "# 0.3\n", "# DO NOT CHANGE THIS CELL\n", "# import dependencies\n", "from dask_ml.model_selection import train_test_split\n", "import xgboost\n", "\n", "import dask_cudf\n", "import cudf\n", "import matplotlib.pyplot as plt" ] }, { "cell_type": "markdown", "id": "ac27c9b9-b8f9-4331-b6dd-f3a1c7da2058", "metadata": {}, "source": [ "### Step 1: Data Preparation ###\n", "The first step is to prepare the data. \n", "\n", "**Instructions**:
\n", "1.1 Modify the `` only and execute the below cell to import data from the given parquet files.
\n", "1.2 Execute the cell below to persist the data in memory and preview the `dask_cudf.DataFrame`.
\n", "1.3 Modify the `` only and execute the cell below to check for null values.
" ] }, { "cell_type": "code", "execution_count": 4, "id": "5967661e-849b-44ed-935e-372f9aba336a", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Total of 2461697 records split across 4 partitions. \n" ] } ], "source": [ "# 1.1\n", "data_dir='data'\n", "ddf=dask_cudf.read_parquet(data_dir)\n", "\n", "print(f'Total of {len(ddf)} records split across {ddf.npartitions} partitions. ')" ] }, { "cell_type": "code", "execution_count": 5, "id": "36adc611-4dde-420f-839b-b8c2a4c729f5", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Columns: Index(['brand', 'cat_0', 'cat_1', 'cat_2', 'cat_3', 'price', 'ts_hour',\n", " 'ts_minute', 'ts_weekday', 'brand_target_sum', 'brand_count',\n", " 'cat_0_target_sum', 'cat_0_count', 'cat_1_target_sum', 'cat_1_count',\n", " 'cat_2_target_sum', 'cat_2_count', 'cat_3_target_sum', 'cat_3_count',\n", " 'TE_cat_0_target', 'TE_cat_1_target', 'TE_cat_2_target',\n", " 'TE_cat_3_target', 'relative_price_product', 'relative_price_category',\n", " 'target'],\n", " dtype='object')\n" ] }, { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
brandcat_0cat_1cat_2cat_3pricets_hourts_minutets_weekdaybrand_target_sum...cat_2_countcat_3_target_sumcat_3_countTE_cat_0_targetTE_cat_1_targetTE_cat_2_targetTE_cat_3_targetrelative_price_productrelative_price_categorytarget
0234322154.02999915146347...751352100943224604050.2981270.2981270.3399660.410503-2.364262e-01-0.5577991
121111870.27002014193187853...898095100943224604050.4822800.4817710.4879520.409943-3.739005e-020.8890451
226521166.6700134321187853...751352100943224604050.3824380.3966450.3397090.4099432.697176e-020.3838941
3202741295.989990134125828...84166100943224604050.3911970.4165530.4209940.410366-4.706145e-02-0.1919931
427322143.240002174312815...751352100943224604050.2985640.2985640.3397090.4099431.764430e-07-0.6461081
\n", "

5 rows × 26 columns

\n", "
" ], "text/plain": [ " brand cat_0 cat_1 cat_2 cat_3 price ts_hour ts_minute \\\n", "0 234 3 2 2 1 54.029999 15 14 \n", "1 2 1 1 1 1 870.270020 14 19 \n", "2 2 6 5 2 1 166.670013 4 32 \n", "3 20 2 7 4 1 295.989990 13 41 \n", "4 27 3 2 2 1 43.240002 17 43 \n", "\n", " ts_weekday brand_target_sum ... cat_2_count cat_3_target_sum \\\n", "0 6 347 ... 751352 1009432 \n", "1 3 187853 ... 898095 1009432 \n", "2 1 187853 ... 751352 1009432 \n", "3 2 5828 ... 84166 1009432 \n", "4 1 2815 ... 751352 1009432 \n", "\n", " cat_3_count TE_cat_0_target TE_cat_1_target TE_cat_2_target \\\n", "0 2460405 0.298127 0.298127 0.339966 \n", "1 2460405 0.482280 0.481771 0.487952 \n", "2 2460405 0.382438 0.396645 0.339709 \n", "3 2460405 0.391197 0.416553 0.420994 \n", "4 2460405 0.298564 0.298564 0.339709 \n", "\n", " TE_cat_3_target relative_price_product relative_price_category target \n", "0 0.410503 -2.364262e-01 -0.557799 1 \n", "1 0.409943 -3.739005e-02 0.889045 1 \n", "2 0.409943 2.697176e-02 0.383894 1 \n", "3 0.410366 -4.706145e-02 -0.191993 1 \n", "4 0.409943 1.764430e-07 -0.646108 1 \n", "\n", "[5 rows x 26 columns]" ] }, "execution_count": 5, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# 1.2\n", "# DO NOT CHANGE THIS CELL\n", "# persist data\n", "ddf=ddf.persist()\n", "wait(ddf)\n", "\n", "print(f'Columns: {ddf.columns}')\n", "\n", "ddf.head()" ] }, { "cell_type": "code", "execution_count": 9, "id": "f6ace495-9b14-487c-8bf6-177907958f5e", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "data": { "text/plain": [ "brand 0\n", "cat_0 0\n", "cat_1 0\n", "cat_2 0\n", "cat_3 0\n", "price 0\n", "ts_hour 0\n", "ts_minute 0\n", "ts_weekday 0\n", "brand_target_sum 0\n", "brand_count 0\n", "cat_0_target_sum 0\n", "cat_0_count 0\n", "cat_1_target_sum 0\n", "cat_1_count 0\n", "cat_2_target_sum 0\n", "cat_2_count 0\n", "cat_3_target_sum 0\n", "cat_3_count 0\n", "TE_cat_0_target 0\n", "TE_cat_1_target 0\n", "TE_cat_2_target 0\n", "TE_cat_3_target 0\n", "relative_price_product 0\n", "relative_price_category 0\n", "target 0\n", "dtype: int64" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# 1.3\n", "ddf.isnull().sum().compute()" ] }, { "cell_type": "markdown", "id": "c569654d-7fe0-4af4-b8ea-d2326bb684a2", "metadata": {}, "source": [ "### Step 2: Feature Engineering ###\n", "Next, we will perform some feature engineering. \n", "\n", "**Instructions**:
\n", "2.1 Modify the ``s only and execute the below cell to `target_encode` the `brand` feature.
\n", "2.2 Modify the ``s only and execute the cell below to create a `relative_price_brand` feature. In case a price is small or 0, please introduce an `epsilon` to avoid division by zero.
\n", "2.3 Execute the cell below to persist the data in memory. " ] }, { "cell_type": "code", "execution_count": 10, "id": "a350a853-d613-474b-8ed5-d99d271fe51b", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
brandcat_0cat_1cat_2cat_3pricets_hourts_minutets_weekdaybrand_target_sum...cat_3_target_sumcat_3_countTE_cat_0_targetTE_cat_1_targetTE_cat_2_targetTE_cat_3_targetrelative_price_productrelative_price_categorytargetbrand_TE
0234322154.02999915146347...100943224604050.2981270.2981270.3399660.410503-2.364262e-01-0.55779910.475995
121111870.27002014193187853...100943224604050.4822800.4817710.4879520.409943-3.739005e-020.88904510.481383
226521166.6700134321187853...100943224604050.3824380.3966450.3397090.4099432.697176e-020.38389410.481383
3202741295.989990134125828...100943224604050.3911970.4165530.4209940.410366-4.706145e-02-0.19199310.463939
427322143.240002174312815...100943224604050.2985640.2985640.3397090.4099431.764430e-07-0.64610810.247647
\n", "

5 rows × 27 columns

\n", "
" ], "text/plain": [ " brand cat_0 cat_1 cat_2 cat_3 price ts_hour ts_minute \\\n", "0 234 3 2 2 1 54.029999 15 14 \n", "1 2 1 1 1 1 870.270020 14 19 \n", "2 2 6 5 2 1 166.670013 4 32 \n", "3 20 2 7 4 1 295.989990 13 41 \n", "4 27 3 2 2 1 43.240002 17 43 \n", "\n", " ts_weekday brand_target_sum ... cat_3_target_sum cat_3_count \\\n", "0 6 347 ... 1009432 2460405 \n", "1 3 187853 ... 1009432 2460405 \n", "2 1 187853 ... 1009432 2460405 \n", "3 2 5828 ... 1009432 2460405 \n", "4 1 2815 ... 1009432 2460405 \n", "\n", " TE_cat_0_target TE_cat_1_target TE_cat_2_target TE_cat_3_target \\\n", "0 0.298127 0.298127 0.339966 0.410503 \n", "1 0.482280 0.481771 0.487952 0.409943 \n", "2 0.382438 0.396645 0.339709 0.409943 \n", "3 0.391197 0.416553 0.420994 0.410366 \n", "4 0.298564 0.298564 0.339709 0.409943 \n", "\n", " relative_price_product relative_price_category target brand_TE \n", "0 -2.364262e-01 -0.557799 1 0.475995 \n", "1 -3.739005e-02 0.889045 1 0.481383 \n", "2 2.697176e-02 0.383894 1 0.481383 \n", "3 -4.706145e-02 -0.191993 1 0.463939 \n", "4 1.764430e-07 -0.646108 1 0.247647 \n", "\n", "[5 rows x 27 columns]" ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# 2.1\n", "def target_encoding(df, cat): \n", " te_df=df.groupby(cat)['target'].mean().reset_index()\n", " te_df.columns=[cat, cat+'_TE']\n", " df=df.merge(te_df, on=cat)\n", " return df\n", "\n", "ddf=target_encoding(ddf, 'brand')\n", "ddf.head()" ] }, { "cell_type": "code", "execution_count": 16, "id": "f9c69922-bd7b-47ad-afb0-d475072cbbca", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
brandcat_0cat_1cat_2cat_3pricets_hourts_minutets_weekdaybrand_target_sum...cat_3_countTE_cat_0_targetTE_cat_1_targetTE_cat_2_targetTE_cat_3_targetrelative_price_productrelative_price_categorytargetbrand_TErelative_price_brand
021111870.27002015153187853...24604050.4822800.4817710.4879520.409943-3.739005e-020.88904510.4813831.177483
14652179.23999843168194...24604050.3854910.4001920.3404530.4103661.565067e-01-0.34205510.3814770.453809
282371385.849976815212586...24604050.3915050.3671030.4033830.409943-3.966478e-030.28542810.3996700.850307
3623141529.309998658617987...24604050.3927930.3686150.5087620.410503-8.071749e-070.40543310.5016171.956347
4681181210.820007741317987...24604050.4044430.4596850.4731500.410366-2.895130e-070.14374010.5016170.779197
\n", "

5 rows × 28 columns

\n", "
" ], "text/plain": [ " brand cat_0 cat_1 cat_2 cat_3 price ts_hour ts_minute \\\n", "0 2 1 1 1 1 870.270020 15 15 \n", "1 4 6 5 2 1 79.239998 4 3 \n", "2 8 2 3 7 1 385.849976 8 15 \n", "3 6 2 3 14 1 529.309998 6 58 \n", "4 6 8 11 8 1 210.820007 7 41 \n", "\n", " ts_weekday brand_target_sum ... cat_3_count TE_cat_0_target \\\n", "0 3 187853 ... 2460405 0.482280 \n", "1 1 68194 ... 2460405 0.385491 \n", "2 2 12586 ... 2460405 0.391505 \n", "3 6 17987 ... 2460405 0.392793 \n", "4 3 17987 ... 2460405 0.404443 \n", "\n", " TE_cat_1_target TE_cat_2_target TE_cat_3_target relative_price_product \\\n", "0 0.481771 0.487952 0.409943 -3.739005e-02 \n", "1 0.400192 0.340453 0.410366 1.565067e-01 \n", "2 0.367103 0.403383 0.409943 -3.966478e-03 \n", "3 0.368615 0.508762 0.410503 -8.071749e-07 \n", "4 0.459685 0.473150 0.410366 -2.895130e-07 \n", "\n", " relative_price_category target brand_TE relative_price_brand \n", "0 0.889045 1 0.481383 1.177483 \n", "1 -0.342055 1 0.381477 0.453809 \n", "2 0.285428 1 0.399670 0.850307 \n", "3 0.405433 1 0.501617 1.956347 \n", "4 0.143740 1 0.501617 0.779197 \n", "\n", "[5 rows x 28 columns]" ] }, "execution_count": 16, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# 2.2\n", "def relative_price(df, cat): \n", " epsilon=1e-5\n", " avg_price_df=df.groupby(cat)['price'].mean().reset_index()\n", " avg_price_df.columns=[cat, 'avg_price_'+cat]\n", " df=df.merge(avg_price_df, on=cat)\n", " df['relative_price_'+cat]=df['price']/(df['avg_price_'+cat] + epsilon)\n", " df=df.drop(columns=['avg_price_'+cat])\n", " return df\n", "\n", "ddf=relative_price(ddf, 'brand')\n", "ddf.head()" ] }, { "cell_type": "code", "execution_count": 17, "id": "baee112d-90bb-49ae-8b31-2e688e05f2ae", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Columns: Index(['brand', 'cat_0', 'cat_1', 'cat_2', 'cat_3', 'price', 'ts_hour',\n", " 'ts_minute', 'ts_weekday', 'brand_target_sum', 'brand_count',\n", " 'cat_0_target_sum', 'cat_0_count', 'cat_1_target_sum', 'cat_1_count',\n", " 'cat_2_target_sum', 'cat_2_count', 'cat_3_target_sum', 'cat_3_count',\n", " 'TE_cat_0_target', 'TE_cat_1_target', 'TE_cat_2_target',\n", " 'TE_cat_3_target', 'relative_price_product', 'relative_price_category',\n", " 'target', 'brand_TE', 'relative_price_brand'],\n", " dtype='object')\n" ] } ], "source": [ "# 2.3\n", "# DO NOT CHANGE THIS CELL\n", "# persist data\n", "ddf=ddf.persist()\n", "wait(ddf)\n", "\n", "print(f'Columns: {ddf.columns}')" ] }, { "cell_type": "markdown", "id": "455daecb-fbd1-4e1e-a831-68b4672f10ed", "metadata": {}, "source": [ "### Step 3: Model Training ###\n", "The next step is to train an `xgboost.dask.XGBoostClassifier`. \n", "\n", "**Instructions**:
\n", "3.1 Execute the below cell to select desired features for training and set as `X`.
\n", "3.2 Modify the `` only and execute the cell below to set `target` as `y`.
\n", "3.3 Modify the `` only and execute the cell below to split the dataset into `X_train`, `X_test`, `y_train`, and `y_test`.
\n", "3.4 Execute the cell below to create `xgb.dask.DaskDMatrix` objects for training and testing as `dtrain` and `dtest`.
\n", "3.5 Modify the ``s only and execute the cell below to set the XGBoost parameters. For this assessment, please use the `auc` evaluation metric.
\n", "3.6 Modify the ``s only and execute the cell below to initiate training.
\n", "3.7 Execute the cell below to view the training and validation history. " ] }, { "cell_type": "code", "execution_count": 18, "id": "89e51eeb-b3d2-4e19-928d-cab0c57a5e2a", "metadata": { "tags": [] }, "outputs": [], "source": [ "# 3.1\n", "# DO NOT CHANGE THIS CELL\n", "X=ddf.drop(columns=['target']).astype('float32')" ] }, { "cell_type": "code", "execution_count": 19, "id": "fcf59460-0837-40db-9f08-328754ea9b8d", "metadata": { "tags": [] }, "outputs": [], "source": [ "# 3.2\n", "y=ddf['target'].astype('float32')" ] }, { "cell_type": "code", "execution_count": 20, "id": "1ff5840d-45e3-40cf-86af-82defd2b5842", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "/opt/conda/envs/rapids/lib/python3.9/site-packages/dask_ml/model_selection/_split.py:462: FutureWarning: The default value for 'shuffle' must be specified when splitting DataFrames. In the future DataFrames will automatically be shuffled within blocks prior to splitting. Specify 'shuffle=True' to adopt the future behavior now, or 'shuffle=False' to retain the previous behavior.\n", " warnings.warn(\n" ] }, { "data": { "text/plain": [ "DoneAndNotDoneFutures(done={, , , , , , , , , , , , , , , }, not_done=set())" ] }, "execution_count": 20, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# 3.3\n", "X_train, X_test, y_train, y_test=train_test_split(X, y, random_state=42)\n", "X_train, X_test, y_train, y_test=client.persist([X_train, X_test, y_train, y_test])\n", "wait([X_train, X_test, y_train, y_test])" ] }, { "cell_type": "code", "execution_count": 21, "id": "5446ffdd-2471-4577-ad78-0ac27dd3d9c2", "metadata": { "tags": [] }, "outputs": [], "source": [ "# 3.4\n", "# DO NOT CHANGE THIS CELL\n", "dtrain=xgboost.dask.DaskDMatrix(client, X_train, y_train)\n", "dtest=xgboost.dask.DaskDMatrix(client, X_test, y_test)" ] }, { "cell_type": "code", "execution_count": 23, "id": "184416f6-2b5a-41f4-a2eb-be6c888534e5", "metadata": { "scrolled": true, "tags": [] }, "outputs": [], "source": [ "# 3.5\n", "xgb_params={ \n", " 'eval_metric': ['auc'], \n", " 'objective': 'binary:logistic',\n", " 'tree_method': 'gpu_hist'\n", "}" ] }, { "cell_type": "code", "execution_count": 25, "id": "121437f1-4ec9-4056-90c2-595c8a116345", "metadata": { "scrolled": true, "tags": [] }, "outputs": [ { "name": "stderr", "output_type": "stream", "text": [ "INFO:distributed.worker:Run out-of-band function '_start_tracker'\n", "[14:46:37] task [xgboost.dask-0]:tcp://127.0.0.1:37937 got new rank 0\n", "[14:46:37] task [xgboost.dask-1]:tcp://127.0.0.1:42015 got new rank 1\n", "[14:46:37] task [xgboost.dask-2]:tcp://127.0.0.1:40413 got new rank 2\n", "[14:46:37] task [xgboost.dask-3]:tcp://127.0.0.1:37709 got new rank 3\n" ] }, { "name": "stdout", "output_type": "stream", "text": [ "[0]\ttrain-auc:0.60997\tvalid-auc:0.61033\n", "[1]\ttrain-auc:0.61210\tvalid-auc:0.61210\n", "[2]\ttrain-auc:0.61293\tvalid-auc:0.61274\n", "[3]\ttrain-auc:0.61370\tvalid-auc:0.61353\n", "[4]\ttrain-auc:0.61431\tvalid-auc:0.61426\n", "[5]\ttrain-auc:0.61492\tvalid-auc:0.61477\n", "[6]\ttrain-auc:0.61526\tvalid-auc:0.61501\n", "[7]\ttrain-auc:0.61586\tvalid-auc:0.61549\n", "[8]\ttrain-auc:0.61632\tvalid-auc:0.61602\n", "[9]\ttrain-auc:0.61704\tvalid-auc:0.61668\n", "[10]\ttrain-auc:0.61768\tvalid-auc:0.61723\n", "[11]\ttrain-auc:0.61799\tvalid-auc:0.61753\n", "[12]\ttrain-auc:0.61826\tvalid-auc:0.61772\n", "[13]\ttrain-auc:0.61868\tvalid-auc:0.61800\n", "[14]\ttrain-auc:0.61913\tvalid-auc:0.61847\n", "[15]\ttrain-auc:0.61984\tvalid-auc:0.61916\n", "[16]\ttrain-auc:0.61993\tvalid-auc:0.61923\n", "[17]\ttrain-auc:0.62045\tvalid-auc:0.61957\n", "[18]\ttrain-auc:0.62071\tvalid-auc:0.61971\n", "[19]\ttrain-auc:0.62089\tvalid-auc:0.61984\n", "[20]\ttrain-auc:0.62100\tvalid-auc:0.61990\n", "[21]\ttrain-auc:0.62122\tvalid-auc:0.62004\n", "[22]\ttrain-auc:0.62166\tvalid-auc:0.62038\n", "[23]\ttrain-auc:0.62223\tvalid-auc:0.62080\n", "[24]\ttrain-auc:0.62236\tvalid-auc:0.62089\n", "[25]\ttrain-auc:0.62260\tvalid-auc:0.62101\n", "[26]\ttrain-auc:0.62268\tvalid-auc:0.62104\n", "[27]\ttrain-auc:0.62289\tvalid-auc:0.62123\n", "[28]\ttrain-auc:0.62329\tvalid-auc:0.62147\n", "[29]\ttrain-auc:0.62376\tvalid-auc:0.62190\n", "[30]\ttrain-auc:0.62397\tvalid-auc:0.62198\n", "[31]\ttrain-auc:0.62402\tvalid-auc:0.62202\n", "[32]\ttrain-auc:0.62412\tvalid-auc:0.62204\n", "[33]\ttrain-auc:0.62451\tvalid-auc:0.62225\n", "[34]\ttrain-auc:0.62486\tvalid-auc:0.62251\n", "[35]\ttrain-auc:0.62497\tvalid-auc:0.62257\n", "[36]\ttrain-auc:0.62529\tvalid-auc:0.62270\n", "[37]\ttrain-auc:0.62554\tvalid-auc:0.62280\n", "[38]\ttrain-auc:0.62581\tvalid-auc:0.62301\n", "[39]\ttrain-auc:0.62613\tvalid-auc:0.62324\n", "[40]\ttrain-auc:0.62640\tvalid-auc:0.62347\n", "[41]\ttrain-auc:0.62664\tvalid-auc:0.62359\n", "[42]\ttrain-auc:0.62673\tvalid-auc:0.62367\n", "[43]\ttrain-auc:0.62691\tvalid-auc:0.62383\n", "[44]\ttrain-auc:0.62707\tvalid-auc:0.62387\n", "[45]\ttrain-auc:0.62733\tvalid-auc:0.62403\n", "[46]\ttrain-auc:0.62749\tvalid-auc:0.62405\n", "[47]\ttrain-auc:0.62769\tvalid-auc:0.62418\n", "[48]\ttrain-auc:0.62797\tvalid-auc:0.62435\n", "[49]\ttrain-auc:0.62811\tvalid-auc:0.62443\n", "[50]\ttrain-auc:0.62821\tvalid-auc:0.62453\n", "[51]\ttrain-auc:0.62845\tvalid-auc:0.62465\n", "[52]\ttrain-auc:0.62859\tvalid-auc:0.62466\n", "[53]\ttrain-auc:0.62874\tvalid-auc:0.62475\n", "[54]\ttrain-auc:0.62881\tvalid-auc:0.62478\n", "[55]\ttrain-auc:0.62896\tvalid-auc:0.62487\n", "[56]\ttrain-auc:0.62920\tvalid-auc:0.62496\n", "[57]\ttrain-auc:0.62943\tvalid-auc:0.62502\n", "[58]\ttrain-auc:0.62969\tvalid-auc:0.62515\n", "[59]\ttrain-auc:0.62983\tvalid-auc:0.62519\n", "[60]\ttrain-auc:0.63000\tvalid-auc:0.62523\n", "[61]\ttrain-auc:0.63025\tvalid-auc:0.62538\n", "[62]\ttrain-auc:0.63037\tvalid-auc:0.62541\n", "[63]\ttrain-auc:0.63047\tvalid-auc:0.62541\n", "[64]\ttrain-auc:0.63060\tvalid-auc:0.62547\n", "[65]\ttrain-auc:0.63081\tvalid-auc:0.62560\n", "[66]\ttrain-auc:0.63101\tvalid-auc:0.62567\n", "[67]\ttrain-auc:0.63124\tvalid-auc:0.62580\n", "[68]\ttrain-auc:0.63146\tvalid-auc:0.62590\n", "[69]\ttrain-auc:0.63157\tvalid-auc:0.62595\n", "[70]\ttrain-auc:0.63178\tvalid-auc:0.62598\n", "[71]\ttrain-auc:0.63189\tvalid-auc:0.62601\n", "[72]\ttrain-auc:0.63203\tvalid-auc:0.62606\n", "[73]\ttrain-auc:0.63223\tvalid-auc:0.62621\n", "[74]\ttrain-auc:0.63237\tvalid-auc:0.62624\n", "[75]\ttrain-auc:0.63254\tvalid-auc:0.62633\n", "[76]\ttrain-auc:0.63265\tvalid-auc:0.62635\n", "[77]\ttrain-auc:0.63268\tvalid-auc:0.62634\n", "[78]\ttrain-auc:0.63273\tvalid-auc:0.62631\n", "[79]\ttrain-auc:0.63282\tvalid-auc:0.62636\n", "[80]\ttrain-auc:0.63301\tvalid-auc:0.62647\n", "[81]\ttrain-auc:0.63324\tvalid-auc:0.62661\n", "[82]\ttrain-auc:0.63341\tvalid-auc:0.62668\n", "[83]\ttrain-auc:0.63354\tvalid-auc:0.62669\n", "[84]\ttrain-auc:0.63370\tvalid-auc:0.62677\n", "[85]\ttrain-auc:0.63377\tvalid-auc:0.62679\n", "[86]\ttrain-auc:0.63381\tvalid-auc:0.62679\n", "[87]\ttrain-auc:0.63399\tvalid-auc:0.62692\n", "[88]\ttrain-auc:0.63413\tvalid-auc:0.62696\n", "[89]\ttrain-auc:0.63431\tvalid-auc:0.62704\n", "[90]\ttrain-auc:0.63447\tvalid-auc:0.62717\n", "[91]\ttrain-auc:0.63471\tvalid-auc:0.62730\n", "[92]\ttrain-auc:0.63489\tvalid-auc:0.62737\n", "[93]\ttrain-auc:0.63502\tvalid-auc:0.62742\n", "[94]\ttrain-auc:0.63515\tvalid-auc:0.62746\n", "[95]\ttrain-auc:0.63526\tvalid-auc:0.62758\n", "[96]\ttrain-auc:0.63538\tvalid-auc:0.62762\n", "[97]\ttrain-auc:0.63544\tvalid-auc:0.62764\n", "[98]\ttrain-auc:0.63563\tvalid-auc:0.62774\n", "[99]\ttrain-auc:0.63573\tvalid-auc:0.62780\n", "[100]\ttrain-auc:0.63588\tvalid-auc:0.62787\n", "[101]\ttrain-auc:0.63598\tvalid-auc:0.62788\n", "[102]\ttrain-auc:0.63613\tvalid-auc:0.62794\n", "[103]\ttrain-auc:0.63632\tvalid-auc:0.62802\n", "[104]\ttrain-auc:0.63647\tvalid-auc:0.62809\n", "[105]\ttrain-auc:0.63660\tvalid-auc:0.62815\n", "[106]\ttrain-auc:0.63673\tvalid-auc:0.62820\n", "[107]\ttrain-auc:0.63683\tvalid-auc:0.62820\n", "[108]\ttrain-auc:0.63694\tvalid-auc:0.62822\n", "[109]\ttrain-auc:0.63704\tvalid-auc:0.62825\n", "[110]\ttrain-auc:0.63710\tvalid-auc:0.62826\n", "[111]\ttrain-auc:0.63721\tvalid-auc:0.62829\n", "[112]\ttrain-auc:0.63737\tvalid-auc:0.62837\n", "[113]\ttrain-auc:0.63751\tvalid-auc:0.62845\n", "[114]\ttrain-auc:0.63765\tvalid-auc:0.62849\n", "[115]\ttrain-auc:0.63781\tvalid-auc:0.62857\n", "[116]\ttrain-auc:0.63791\tvalid-auc:0.62863\n", "[117]\ttrain-auc:0.63797\tvalid-auc:0.62864\n", "[118]\ttrain-auc:0.63807\tvalid-auc:0.62868\n", "[119]\ttrain-auc:0.63819\tvalid-auc:0.62872\n", "[120]\ttrain-auc:0.63825\tvalid-auc:0.62871\n", "[121]\ttrain-auc:0.63832\tvalid-auc:0.62874\n", "[122]\ttrain-auc:0.63846\tvalid-auc:0.62881\n", "[123]\ttrain-auc:0.63857\tvalid-auc:0.62887\n", "[124]\ttrain-auc:0.63871\tvalid-auc:0.62891\n", "[125]\ttrain-auc:0.63890\tvalid-auc:0.62900\n", "[126]\ttrain-auc:0.63899\tvalid-auc:0.62901\n", "[127]\ttrain-auc:0.63912\tvalid-auc:0.62905\n", "[128]\ttrain-auc:0.63921\tvalid-auc:0.62909\n", "[129]\ttrain-auc:0.63931\tvalid-auc:0.62910\n", "[130]\ttrain-auc:0.63935\tvalid-auc:0.62907\n", "[131]\ttrain-auc:0.63950\tvalid-auc:0.62914\n", "[132]\ttrain-auc:0.63964\tvalid-auc:0.62918\n", "[133]\ttrain-auc:0.63975\tvalid-auc:0.62922\n", "[134]\ttrain-auc:0.63988\tvalid-auc:0.62929\n", "[135]\ttrain-auc:0.63994\tvalid-auc:0.62931\n", "[136]\ttrain-auc:0.64006\tvalid-auc:0.62936\n", "[137]\ttrain-auc:0.64022\tvalid-auc:0.62944\n", "[138]\ttrain-auc:0.64034\tvalid-auc:0.62953\n", "[139]\ttrain-auc:0.64048\tvalid-auc:0.62958\n", "[140]\ttrain-auc:0.64065\tvalid-auc:0.62967\n", "[141]\ttrain-auc:0.64081\tvalid-auc:0.62963\n", "[142]\ttrain-auc:0.64092\tvalid-auc:0.62967\n", "[143]\ttrain-auc:0.64102\tvalid-auc:0.62970\n", "[144]\ttrain-auc:0.64108\tvalid-auc:0.62973\n", "[145]\ttrain-auc:0.64121\tvalid-auc:0.62982\n", "[146]\ttrain-auc:0.64126\tvalid-auc:0.62979\n", "[147]\ttrain-auc:0.64134\tvalid-auc:0.62983\n", "[148]\ttrain-auc:0.64146\tvalid-auc:0.62987\n", "[149]\ttrain-auc:0.64158\tvalid-auc:0.62994\n", "[150]\ttrain-auc:0.64173\tvalid-auc:0.62998\n", "[151]\ttrain-auc:0.64177\tvalid-auc:0.63001\n", "[152]\ttrain-auc:0.64186\tvalid-auc:0.63000\n", "[153]\ttrain-auc:0.64194\tvalid-auc:0.63001\n", "[154]\ttrain-auc:0.64204\tvalid-auc:0.63003\n", "[155]\ttrain-auc:0.64216\tvalid-auc:0.63009\n", "[156]\ttrain-auc:0.64233\tvalid-auc:0.63016\n", "[157]\ttrain-auc:0.64248\tvalid-auc:0.63024\n", "[158]\ttrain-auc:0.64259\tvalid-auc:0.63026\n", "[159]\ttrain-auc:0.64270\tvalid-auc:0.63033\n", "[160]\ttrain-auc:0.64282\tvalid-auc:0.63035\n", "[161]\ttrain-auc:0.64290\tvalid-auc:0.63038\n", "[162]\ttrain-auc:0.64298\tvalid-auc:0.63042\n", "[163]\ttrain-auc:0.64312\tvalid-auc:0.63045\n", "[164]\ttrain-auc:0.64324\tvalid-auc:0.63052\n", "[165]\ttrain-auc:0.64334\tvalid-auc:0.63051\n", "[166]\ttrain-auc:0.64342\tvalid-auc:0.63055\n", "[167]\ttrain-auc:0.64348\tvalid-auc:0.63056\n", "[168]\ttrain-auc:0.64355\tvalid-auc:0.63058\n", "[169]\ttrain-auc:0.64358\tvalid-auc:0.63058\n", "[170]\ttrain-auc:0.64368\tvalid-auc:0.63060\n", "[171]\ttrain-auc:0.64381\tvalid-auc:0.63068\n", "[172]\ttrain-auc:0.64391\tvalid-auc:0.63069\n", "[173]\ttrain-auc:0.64401\tvalid-auc:0.63070\n", "[174]\ttrain-auc:0.64408\tvalid-auc:0.63072\n", "[175]\ttrain-auc:0.64420\tvalid-auc:0.63076\n", "[176]\ttrain-auc:0.64433\tvalid-auc:0.63078\n", "[177]\ttrain-auc:0.64447\tvalid-auc:0.63080\n", "[178]\ttrain-auc:0.64455\tvalid-auc:0.63082\n", "[179]\ttrain-auc:0.64468\tvalid-auc:0.63089\n", "[180]\ttrain-auc:0.64478\tvalid-auc:0.63090\n", "[181]\ttrain-auc:0.64486\tvalid-auc:0.63091\n", "[182]\ttrain-auc:0.64500\tvalid-auc:0.63092\n", "[183]\ttrain-auc:0.64509\tvalid-auc:0.63093\n", "[184]\ttrain-auc:0.64513\tvalid-auc:0.63095\n", "[185]\ttrain-auc:0.64522\tvalid-auc:0.63100\n", "[186]\ttrain-auc:0.64536\tvalid-auc:0.63105\n", "[187]\ttrain-auc:0.64549\tvalid-auc:0.63106\n", "[188]\ttrain-auc:0.64561\tvalid-auc:0.63111\n", "[189]\ttrain-auc:0.64569\tvalid-auc:0.63113\n", "[190]\ttrain-auc:0.64573\tvalid-auc:0.63113\n", "[191]\ttrain-auc:0.64578\tvalid-auc:0.63112\n", "[192]\ttrain-auc:0.64591\tvalid-auc:0.63115\n", "[193]\ttrain-auc:0.64607\tvalid-auc:0.63119\n", "[194]\ttrain-auc:0.64613\tvalid-auc:0.63120\n", "[195]\ttrain-auc:0.64624\tvalid-auc:0.63127\n", "[196]\ttrain-auc:0.64633\tvalid-auc:0.63127\n", "[197]\ttrain-auc:0.64640\tvalid-auc:0.63129\n", "[198]\ttrain-auc:0.64648\tvalid-auc:0.63132\n", "[199]\ttrain-auc:0.64654\tvalid-auc:0.63130\n", "[200]\ttrain-auc:0.64658\tvalid-auc:0.63130\n", "[201]\ttrain-auc:0.64669\tvalid-auc:0.63134\n", "[202]\ttrain-auc:0.64676\tvalid-auc:0.63133\n", "[203]\ttrain-auc:0.64686\tvalid-auc:0.63137\n", "[204]\ttrain-auc:0.64691\tvalid-auc:0.63135\n", "[205]\ttrain-auc:0.64696\tvalid-auc:0.63135\n", "[206]\ttrain-auc:0.64698\tvalid-auc:0.63136\n", "[207]\ttrain-auc:0.64702\tvalid-auc:0.63136\n", "[208]\ttrain-auc:0.64712\tvalid-auc:0.63142\n", "[209]\ttrain-auc:0.64720\tvalid-auc:0.63143\n", "[210]\ttrain-auc:0.64733\tvalid-auc:0.63153\n", "[211]\ttrain-auc:0.64743\tvalid-auc:0.63157\n", "[212]\ttrain-auc:0.64753\tvalid-auc:0.63158\n", "[213]\ttrain-auc:0.64767\tvalid-auc:0.63166\n", "[214]\ttrain-auc:0.64773\tvalid-auc:0.63165\n", "[215]\ttrain-auc:0.64778\tvalid-auc:0.63166\n", "[216]\ttrain-auc:0.64786\tvalid-auc:0.63167\n", "[217]\ttrain-auc:0.64795\tvalid-auc:0.63166\n", "[218]\ttrain-auc:0.64806\tvalid-auc:0.63169\n", "[219]\ttrain-auc:0.64815\tvalid-auc:0.63174\n", "[220]\ttrain-auc:0.64823\tvalid-auc:0.63176\n", "[221]\ttrain-auc:0.64834\tvalid-auc:0.63183\n", "[222]\ttrain-auc:0.64845\tvalid-auc:0.63184\n", "[223]\ttrain-auc:0.64854\tvalid-auc:0.63186\n", "[224]\ttrain-auc:0.64863\tvalid-auc:0.63185\n", "[225]\ttrain-auc:0.64873\tvalid-auc:0.63187\n", "[226]\ttrain-auc:0.64878\tvalid-auc:0.63187\n", "[227]\ttrain-auc:0.64891\tvalid-auc:0.63190\n", "[228]\ttrain-auc:0.64901\tvalid-auc:0.63194\n", "[229]\ttrain-auc:0.64912\tvalid-auc:0.63196\n", "[230]\ttrain-auc:0.64923\tvalid-auc:0.63198\n", "[231]\ttrain-auc:0.64936\tvalid-auc:0.63201\n", "[232]\ttrain-auc:0.64950\tvalid-auc:0.63206\n", "[233]\ttrain-auc:0.64958\tvalid-auc:0.63209\n", "[234]\ttrain-auc:0.64970\tvalid-auc:0.63219\n", "[235]\ttrain-auc:0.64980\tvalid-auc:0.63225\n", "[236]\ttrain-auc:0.64989\tvalid-auc:0.63227\n", "[237]\ttrain-auc:0.64998\tvalid-auc:0.63229\n", "[238]\ttrain-auc:0.65003\tvalid-auc:0.63229\n", "[239]\ttrain-auc:0.65013\tvalid-auc:0.63232\n", "[240]\ttrain-auc:0.65017\tvalid-auc:0.63232\n", "[241]\ttrain-auc:0.65027\tvalid-auc:0.63233\n", "[242]\ttrain-auc:0.65037\tvalid-auc:0.63233\n", "[243]\ttrain-auc:0.65044\tvalid-auc:0.63233\n", "[244]\ttrain-auc:0.65049\tvalid-auc:0.63235\n", "[245]\ttrain-auc:0.65058\tvalid-auc:0.63237\n", "[246]\ttrain-auc:0.65066\tvalid-auc:0.63238\n", "[247]\ttrain-auc:0.65074\tvalid-auc:0.63237\n", "[248]\ttrain-auc:0.65081\tvalid-auc:0.63238\n", "[249]\ttrain-auc:0.65088\tvalid-auc:0.63241\n", "[250]\ttrain-auc:0.65094\tvalid-auc:0.63243\n", "[251]\ttrain-auc:0.65100\tvalid-auc:0.63246\n", "[252]\ttrain-auc:0.65109\tvalid-auc:0.63247\n", "[253]\ttrain-auc:0.65123\tvalid-auc:0.63250\n", "[254]\ttrain-auc:0.65132\tvalid-auc:0.63251\n", "[255]\ttrain-auc:0.65138\tvalid-auc:0.63253\n", "[256]\ttrain-auc:0.65150\tvalid-auc:0.63256\n", "[257]\ttrain-auc:0.65155\tvalid-auc:0.63258\n", "[258]\ttrain-auc:0.65166\tvalid-auc:0.63261\n", "[259]\ttrain-auc:0.65182\tvalid-auc:0.63265\n", "[260]\ttrain-auc:0.65186\tvalid-auc:0.63265\n", "[261]\ttrain-auc:0.65198\tvalid-auc:0.63275\n", "[262]\ttrain-auc:0.65207\tvalid-auc:0.63274\n", "[263]\ttrain-auc:0.65212\tvalid-auc:0.63273\n", "[264]\ttrain-auc:0.65228\tvalid-auc:0.63286\n", "[265]\ttrain-auc:0.65239\tvalid-auc:0.63288\n", "[266]\ttrain-auc:0.65247\tvalid-auc:0.63292\n", "[267]\ttrain-auc:0.65251\tvalid-auc:0.63292\n", "[268]\ttrain-auc:0.65260\tvalid-auc:0.63294\n", "[269]\ttrain-auc:0.65265\tvalid-auc:0.63296\n", "[270]\ttrain-auc:0.65269\tvalid-auc:0.63297\n", "[271]\ttrain-auc:0.65277\tvalid-auc:0.63299\n", "[272]\ttrain-auc:0.65281\tvalid-auc:0.63300\n", "[273]\ttrain-auc:0.65288\tvalid-auc:0.63301\n", "[274]\ttrain-auc:0.65301\tvalid-auc:0.63304\n", "[275]\ttrain-auc:0.65312\tvalid-auc:0.63311\n", "[276]\ttrain-auc:0.65323\tvalid-auc:0.63314\n", "[277]\ttrain-auc:0.65335\tvalid-auc:0.63316\n", "[278]\ttrain-auc:0.65344\tvalid-auc:0.63318\n", "[279]\ttrain-auc:0.65355\tvalid-auc:0.63317\n", "[280]\ttrain-auc:0.65362\tvalid-auc:0.63319\n", "[281]\ttrain-auc:0.65370\tvalid-auc:0.63319\n", "[282]\ttrain-auc:0.65378\tvalid-auc:0.63316\n", "[283]\ttrain-auc:0.65387\tvalid-auc:0.63317\n", "[284]\ttrain-auc:0.65398\tvalid-auc:0.63320\n", "[285]\ttrain-auc:0.65405\tvalid-auc:0.63322\n", "[286]\ttrain-auc:0.65417\tvalid-auc:0.63326\n", "[287]\ttrain-auc:0.65423\tvalid-auc:0.63329\n", "[288]\ttrain-auc:0.65434\tvalid-auc:0.63333\n", "[289]\ttrain-auc:0.65448\tvalid-auc:0.63339\n", "[290]\ttrain-auc:0.65457\tvalid-auc:0.63342\n", "[291]\ttrain-auc:0.65468\tvalid-auc:0.63344\n", "[292]\ttrain-auc:0.65477\tvalid-auc:0.63346\n", "[293]\ttrain-auc:0.65490\tvalid-auc:0.63350\n", "[294]\ttrain-auc:0.65497\tvalid-auc:0.63352\n", "[295]\ttrain-auc:0.65509\tvalid-auc:0.63356\n", "[296]\ttrain-auc:0.65516\tvalid-auc:0.63358\n", "[297]\ttrain-auc:0.65524\tvalid-auc:0.63359\n", "[298]\ttrain-auc:0.65530\tvalid-auc:0.63363\n", "[299]\ttrain-auc:0.65538\tvalid-auc:0.63364\n", "[300]\ttrain-auc:0.65547\tvalid-auc:0.63369\n", "[301]\ttrain-auc:0.65555\tvalid-auc:0.63368\n", "[302]\ttrain-auc:0.65566\tvalid-auc:0.63375\n", "[303]\ttrain-auc:0.65575\tvalid-auc:0.63377\n", "[304]\ttrain-auc:0.65585\tvalid-auc:0.63380\n", "[305]\ttrain-auc:0.65599\tvalid-auc:0.63389\n", "[306]\ttrain-auc:0.65608\tvalid-auc:0.63392\n", "[307]\ttrain-auc:0.65620\tvalid-auc:0.63398\n", "[308]\ttrain-auc:0.65630\tvalid-auc:0.63403\n", "[309]\ttrain-auc:0.65640\tvalid-auc:0.63406\n", "[310]\ttrain-auc:0.65645\tvalid-auc:0.63406\n", "[311]\ttrain-auc:0.65656\tvalid-auc:0.63410\n", "[312]\ttrain-auc:0.65669\tvalid-auc:0.63420\n", "[313]\ttrain-auc:0.65673\tvalid-auc:0.63422\n", "[314]\ttrain-auc:0.65679\tvalid-auc:0.63424\n", "[315]\ttrain-auc:0.65684\tvalid-auc:0.63426\n", "[316]\ttrain-auc:0.65696\tvalid-auc:0.63426\n", "[317]\ttrain-auc:0.65707\tvalid-auc:0.63425\n", "[318]\ttrain-auc:0.65711\tvalid-auc:0.63425\n", "[319]\ttrain-auc:0.65721\tvalid-auc:0.63427\n", "[320]\ttrain-auc:0.65731\tvalid-auc:0.63430\n", "[321]\ttrain-auc:0.65739\tvalid-auc:0.63437\n", "[322]\ttrain-auc:0.65749\tvalid-auc:0.63440\n", "[323]\ttrain-auc:0.65758\tvalid-auc:0.63438\n", "[324]\ttrain-auc:0.65764\tvalid-auc:0.63438\n", "[325]\ttrain-auc:0.65774\tvalid-auc:0.63442\n", "[326]\ttrain-auc:0.65780\tvalid-auc:0.63443\n", "[327]\ttrain-auc:0.65789\tvalid-auc:0.63444\n", "[328]\ttrain-auc:0.65796\tvalid-auc:0.63444\n", "[329]\ttrain-auc:0.65811\tvalid-auc:0.63448\n", "[330]\ttrain-auc:0.65821\tvalid-auc:0.63449\n", "[331]\ttrain-auc:0.65830\tvalid-auc:0.63451\n", "[332]\ttrain-auc:0.65836\tvalid-auc:0.63457\n", "[333]\ttrain-auc:0.65842\tvalid-auc:0.63458\n", "[334]\ttrain-auc:0.65855\tvalid-auc:0.63461\n", "[335]\ttrain-auc:0.65865\tvalid-auc:0.63464\n", "[336]\ttrain-auc:0.65876\tvalid-auc:0.63468\n", "[337]\ttrain-auc:0.65883\tvalid-auc:0.63471\n", "[338]\ttrain-auc:0.65895\tvalid-auc:0.63479\n", "[339]\ttrain-auc:0.65898\tvalid-auc:0.63479\n", "[340]\ttrain-auc:0.65910\tvalid-auc:0.63484\n", "[341]\ttrain-auc:0.65918\tvalid-auc:0.63484\n", "[342]\ttrain-auc:0.65924\tvalid-auc:0.63487\n", "[343]\ttrain-auc:0.65934\tvalid-auc:0.63489\n", "[344]\ttrain-auc:0.65942\tvalid-auc:0.63492\n", "[345]\ttrain-auc:0.65951\tvalid-auc:0.63493\n", "[346]\ttrain-auc:0.65960\tvalid-auc:0.63494\n", "[347]\ttrain-auc:0.65964\tvalid-auc:0.63496\n", "[348]\ttrain-auc:0.65974\tvalid-auc:0.63496\n", "[349]\ttrain-auc:0.65981\tvalid-auc:0.63497\n", "[350]\ttrain-auc:0.65989\tvalid-auc:0.63499\n", "[351]\ttrain-auc:0.65997\tvalid-auc:0.63498\n", "[352]\ttrain-auc:0.66004\tvalid-auc:0.63499\n", "[353]\ttrain-auc:0.66008\tvalid-auc:0.63499\n", "[354]\ttrain-auc:0.66016\tvalid-auc:0.63503\n", "[355]\ttrain-auc:0.66024\tvalid-auc:0.63502\n", "[356]\ttrain-auc:0.66031\tvalid-auc:0.63503\n", "[357]\ttrain-auc:0.66040\tvalid-auc:0.63505\n", "[358]\ttrain-auc:0.66046\tvalid-auc:0.63504\n", "[359]\ttrain-auc:0.66055\tvalid-auc:0.63505\n", "[360]\ttrain-auc:0.66062\tvalid-auc:0.63510\n", "[361]\ttrain-auc:0.66068\tvalid-auc:0.63508\n", "[362]\ttrain-auc:0.66078\tvalid-auc:0.63513\n", "[363]\ttrain-auc:0.66085\tvalid-auc:0.63514\n", "[364]\ttrain-auc:0.66092\tvalid-auc:0.63520\n", "[365]\ttrain-auc:0.66096\tvalid-auc:0.63518\n", "[366]\ttrain-auc:0.66100\tvalid-auc:0.63517\n", "[367]\ttrain-auc:0.66110\tvalid-auc:0.63518\n", "[368]\ttrain-auc:0.66117\tvalid-auc:0.63521\n", "[369]\ttrain-auc:0.66120\tvalid-auc:0.63519\n", "[370]\ttrain-auc:0.66129\tvalid-auc:0.63521\n", "[371]\ttrain-auc:0.66133\tvalid-auc:0.63522\n", "[372]\ttrain-auc:0.66142\tvalid-auc:0.63524\n", "[373]\ttrain-auc:0.66145\tvalid-auc:0.63525\n", "[374]\ttrain-auc:0.66148\tvalid-auc:0.63525\n", "[375]\ttrain-auc:0.66152\tvalid-auc:0.63526\n", "[376]\ttrain-auc:0.66162\tvalid-auc:0.63531\n", "[377]\ttrain-auc:0.66171\tvalid-auc:0.63532\n", "[378]\ttrain-auc:0.66176\tvalid-auc:0.63530\n", "[379]\ttrain-auc:0.66182\tvalid-auc:0.63530\n", "[380]\ttrain-auc:0.66193\tvalid-auc:0.63532\n", "[381]\ttrain-auc:0.66199\tvalid-auc:0.63534\n", "[382]\ttrain-auc:0.66205\tvalid-auc:0.63536\n", "[383]\ttrain-auc:0.66210\tvalid-auc:0.63535\n", "[384]\ttrain-auc:0.66220\tvalid-auc:0.63540\n", "[385]\ttrain-auc:0.66228\tvalid-auc:0.63537\n", "[386]\ttrain-auc:0.66238\tvalid-auc:0.63539\n", "[387]\ttrain-auc:0.66245\tvalid-auc:0.63542\n", "[388]\ttrain-auc:0.66251\tvalid-auc:0.63543\n", "[389]\ttrain-auc:0.66258\tvalid-auc:0.63545\n", "[390]\ttrain-auc:0.66267\tvalid-auc:0.63549\n", "[391]\ttrain-auc:0.66273\tvalid-auc:0.63550\n", "[392]\ttrain-auc:0.66279\tvalid-auc:0.63553\n", "[393]\ttrain-auc:0.66287\tvalid-auc:0.63551\n", "[394]\ttrain-auc:0.66290\tvalid-auc:0.63552\n", "[395]\ttrain-auc:0.66295\tvalid-auc:0.63551\n", "[396]\ttrain-auc:0.66300\tvalid-auc:0.63552\n", "[397]\ttrain-auc:0.66308\tvalid-auc:0.63553\n", "[398]\ttrain-auc:0.66316\tvalid-auc:0.63551\n", "[399]\ttrain-auc:0.66320\tvalid-auc:0.63552\n", "[400]\ttrain-auc:0.66326\tvalid-auc:0.63555\n", "[401]\ttrain-auc:0.66333\tvalid-auc:0.63554\n", "[402]\ttrain-auc:0.66340\tvalid-auc:0.63558\n", "[403]\ttrain-auc:0.66349\tvalid-auc:0.63558\n", "[404]\ttrain-auc:0.66356\tvalid-auc:0.63560\n", "[405]\ttrain-auc:0.66363\tvalid-auc:0.63562\n", "[406]\ttrain-auc:0.66369\tvalid-auc:0.63563\n", "[407]\ttrain-auc:0.66373\tvalid-auc:0.63564\n", "[408]\ttrain-auc:0.66379\tvalid-auc:0.63568\n", "[409]\ttrain-auc:0.66388\tvalid-auc:0.63571\n", "[410]\ttrain-auc:0.66397\tvalid-auc:0.63577\n", "[411]\ttrain-auc:0.66409\tvalid-auc:0.63579\n", "[412]\ttrain-auc:0.66414\tvalid-auc:0.63580\n", "[413]\ttrain-auc:0.66425\tvalid-auc:0.63585\n", "[414]\ttrain-auc:0.66433\tvalid-auc:0.63589\n", "[415]\ttrain-auc:0.66441\tvalid-auc:0.63592\n", "[416]\ttrain-auc:0.66446\tvalid-auc:0.63595\n", "[417]\ttrain-auc:0.66454\tvalid-auc:0.63597\n", "[418]\ttrain-auc:0.66459\tvalid-auc:0.63599\n", "[419]\ttrain-auc:0.66467\tvalid-auc:0.63601\n", "[420]\ttrain-auc:0.66475\tvalid-auc:0.63603\n", "[421]\ttrain-auc:0.66482\tvalid-auc:0.63602\n", "[422]\ttrain-auc:0.66487\tvalid-auc:0.63604\n", "[423]\ttrain-auc:0.66493\tvalid-auc:0.63606\n", "[424]\ttrain-auc:0.66501\tvalid-auc:0.63607\n", "[425]\ttrain-auc:0.66509\tvalid-auc:0.63607\n", "[426]\ttrain-auc:0.66515\tvalid-auc:0.63606\n", "[427]\ttrain-auc:0.66525\tvalid-auc:0.63606\n", "[428]\ttrain-auc:0.66527\tvalid-auc:0.63606\n", "[429]\ttrain-auc:0.66531\tvalid-auc:0.63609\n", "[430]\ttrain-auc:0.66533\tvalid-auc:0.63611\n", "[431]\ttrain-auc:0.66543\tvalid-auc:0.63613\n", "[432]\ttrain-auc:0.66549\tvalid-auc:0.63617\n", "[433]\ttrain-auc:0.66559\tvalid-auc:0.63621\n", "[434]\ttrain-auc:0.66565\tvalid-auc:0.63626\n", "[435]\ttrain-auc:0.66571\tvalid-auc:0.63626\n", "[436]\ttrain-auc:0.66577\tvalid-auc:0.63628\n", "[437]\ttrain-auc:0.66586\tvalid-auc:0.63629\n", "[438]\ttrain-auc:0.66592\tvalid-auc:0.63630\n", "[439]\ttrain-auc:0.66601\tvalid-auc:0.63632\n", "[440]\ttrain-auc:0.66605\tvalid-auc:0.63633\n", "[441]\ttrain-auc:0.66610\tvalid-auc:0.63633\n", "[442]\ttrain-auc:0.66616\tvalid-auc:0.63634\n", "[443]\ttrain-auc:0.66619\tvalid-auc:0.63634\n", "[444]\ttrain-auc:0.66630\tvalid-auc:0.63641\n", "[445]\ttrain-auc:0.66637\tvalid-auc:0.63643\n", "[446]\ttrain-auc:0.66644\tvalid-auc:0.63644\n", "[447]\ttrain-auc:0.66649\tvalid-auc:0.63646\n", "[448]\ttrain-auc:0.66659\tvalid-auc:0.63650\n", "[449]\ttrain-auc:0.66664\tvalid-auc:0.63651\n", "[450]\ttrain-auc:0.66668\tvalid-auc:0.63650\n", "[451]\ttrain-auc:0.66671\tvalid-auc:0.63652\n", "[452]\ttrain-auc:0.66674\tvalid-auc:0.63651\n", "[453]\ttrain-auc:0.66679\tvalid-auc:0.63653\n", "[454]\ttrain-auc:0.66685\tvalid-auc:0.63657\n", "[455]\ttrain-auc:0.66692\tvalid-auc:0.63656\n", "[456]\ttrain-auc:0.66695\tvalid-auc:0.63658\n", "[457]\ttrain-auc:0.66701\tvalid-auc:0.63659\n", "[458]\ttrain-auc:0.66707\tvalid-auc:0.63661\n", "[459]\ttrain-auc:0.66714\tvalid-auc:0.63662\n", "[460]\ttrain-auc:0.66728\tvalid-auc:0.63667\n", "[461]\ttrain-auc:0.66733\tvalid-auc:0.63669\n", "[462]\ttrain-auc:0.66738\tvalid-auc:0.63669\n", "[463]\ttrain-auc:0.66744\tvalid-auc:0.63669\n", "[464]\ttrain-auc:0.66747\tvalid-auc:0.63668\n", "[465]\ttrain-auc:0.66751\tvalid-auc:0.63670\n", "[466]\ttrain-auc:0.66760\tvalid-auc:0.63672\n", "[467]\ttrain-auc:0.66768\tvalid-auc:0.63673\n", "[468]\ttrain-auc:0.66777\tvalid-auc:0.63676\n", "[469]\ttrain-auc:0.66781\tvalid-auc:0.63677\n", "[470]\ttrain-auc:0.66790\tvalid-auc:0.63678\n", "[471]\ttrain-auc:0.66795\tvalid-auc:0.63678\n", "[472]\ttrain-auc:0.66800\tvalid-auc:0.63680\n", "[473]\ttrain-auc:0.66807\tvalid-auc:0.63679\n", "[474]\ttrain-auc:0.66811\tvalid-auc:0.63679\n", "[475]\ttrain-auc:0.66818\tvalid-auc:0.63680\n", "[476]\ttrain-auc:0.66825\tvalid-auc:0.63683\n", "[477]\ttrain-auc:0.66834\tvalid-auc:0.63687\n", "[478]\ttrain-auc:0.66838\tvalid-auc:0.63687\n", "[479]\ttrain-auc:0.66846\tvalid-auc:0.63688\n", "[480]\ttrain-auc:0.66852\tvalid-auc:0.63689\n", "[481]\ttrain-auc:0.66858\tvalid-auc:0.63691\n", "[482]\ttrain-auc:0.66869\tvalid-auc:0.63691\n", "[483]\ttrain-auc:0.66872\tvalid-auc:0.63689\n", "[484]\ttrain-auc:0.66876\tvalid-auc:0.63687\n", "[485]\ttrain-auc:0.66878\tvalid-auc:0.63688\n", "[486]\ttrain-auc:0.66882\tvalid-auc:0.63688\n", "[487]\ttrain-auc:0.66883\tvalid-auc:0.63689\n", "[488]\ttrain-auc:0.66887\tvalid-auc:0.63691\n", "[489]\ttrain-auc:0.66895\tvalid-auc:0.63691\n", "[490]\ttrain-auc:0.66899\tvalid-auc:0.63691\n", "[491]\ttrain-auc:0.66905\tvalid-auc:0.63693\n", "[492]\ttrain-auc:0.66912\tvalid-auc:0.63691\n", "[493]\ttrain-auc:0.66919\tvalid-auc:0.63692\n", "[494]\ttrain-auc:0.66927\tvalid-auc:0.63691\n", "[495]\ttrain-auc:0.66932\tvalid-auc:0.63690\n", "[496]\ttrain-auc:0.66940\tvalid-auc:0.63693\n", "[497]\ttrain-auc:0.66946\tvalid-auc:0.63695\n", "[498]\ttrain-auc:0.66957\tvalid-auc:0.63697\n", "[499]\ttrain-auc:0.66961\tvalid-auc:0.63695\n", "[500]\ttrain-auc:0.66967\tvalid-auc:0.63698\n", "[501]\ttrain-auc:0.66977\tvalid-auc:0.63701\n", "[502]\ttrain-auc:0.66983\tvalid-auc:0.63704\n", "[503]\ttrain-auc:0.66993\tvalid-auc:0.63701\n", "[504]\ttrain-auc:0.67000\tvalid-auc:0.63705\n", "[505]\ttrain-auc:0.67003\tvalid-auc:0.63704\n", "[506]\ttrain-auc:0.67006\tvalid-auc:0.63704\n", "[507]\ttrain-auc:0.67012\tvalid-auc:0.63705\n", "[508]\ttrain-auc:0.67022\tvalid-auc:0.63709\n", "[509]\ttrain-auc:0.67032\tvalid-auc:0.63711\n", "[510]\ttrain-auc:0.67039\tvalid-auc:0.63712\n", "[511]\ttrain-auc:0.67046\tvalid-auc:0.63716\n", "[512]\ttrain-auc:0.67053\tvalid-auc:0.63716\n", "[513]\ttrain-auc:0.67056\tvalid-auc:0.63717\n", "[514]\ttrain-auc:0.67063\tvalid-auc:0.63716\n", "[515]\ttrain-auc:0.67067\tvalid-auc:0.63717\n", "[516]\ttrain-auc:0.67074\tvalid-auc:0.63716\n", "[517]\ttrain-auc:0.67080\tvalid-auc:0.63717\n", "[518]\ttrain-auc:0.67083\tvalid-auc:0.63718\n", "[519]\ttrain-auc:0.67089\tvalid-auc:0.63717\n", "[520]\ttrain-auc:0.67095\tvalid-auc:0.63719\n", "[521]\ttrain-auc:0.67101\tvalid-auc:0.63718\n", "[522]\ttrain-auc:0.67105\tvalid-auc:0.63719\n", "[523]\ttrain-auc:0.67111\tvalid-auc:0.63720\n", "[524]\ttrain-auc:0.67120\tvalid-auc:0.63723\n", "[525]\ttrain-auc:0.67125\tvalid-auc:0.63725\n", "[526]\ttrain-auc:0.67127\tvalid-auc:0.63726\n", "[527]\ttrain-auc:0.67128\tvalid-auc:0.63725\n", "[528]\ttrain-auc:0.67134\tvalid-auc:0.63730\n", "[529]\ttrain-auc:0.67142\tvalid-auc:0.63729\n", "[530]\ttrain-auc:0.67148\tvalid-auc:0.63729\n", "[531]\ttrain-auc:0.67153\tvalid-auc:0.63731\n", "[532]\ttrain-auc:0.67160\tvalid-auc:0.63730\n", "[533]\ttrain-auc:0.67165\tvalid-auc:0.63734\n", "[534]\ttrain-auc:0.67171\tvalid-auc:0.63734\n", "[535]\ttrain-auc:0.67181\tvalid-auc:0.63737\n", "[536]\ttrain-auc:0.67184\tvalid-auc:0.63737\n", "[537]\ttrain-auc:0.67189\tvalid-auc:0.63737\n", "[538]\ttrain-auc:0.67195\tvalid-auc:0.63739\n", "[539]\ttrain-auc:0.67198\tvalid-auc:0.63740\n", "[540]\ttrain-auc:0.67202\tvalid-auc:0.63742\n", "[541]\ttrain-auc:0.67209\tvalid-auc:0.63744\n", "[542]\ttrain-auc:0.67216\tvalid-auc:0.63748\n", "[543]\ttrain-auc:0.67222\tvalid-auc:0.63749\n", "[544]\ttrain-auc:0.67229\tvalid-auc:0.63752\n", "[545]\ttrain-auc:0.67234\tvalid-auc:0.63752\n", "[546]\ttrain-auc:0.67243\tvalid-auc:0.63754\n", "[547]\ttrain-auc:0.67252\tvalid-auc:0.63759\n", "[548]\ttrain-auc:0.67255\tvalid-auc:0.63757\n", "[549]\ttrain-auc:0.67261\tvalid-auc:0.63759\n", "[550]\ttrain-auc:0.67267\tvalid-auc:0.63759\n", "[551]\ttrain-auc:0.67273\tvalid-auc:0.63762\n", "[552]\ttrain-auc:0.67284\tvalid-auc:0.63765\n", "[553]\ttrain-auc:0.67289\tvalid-auc:0.63765\n", "[554]\ttrain-auc:0.67296\tvalid-auc:0.63767\n", "[555]\ttrain-auc:0.67308\tvalid-auc:0.63770\n", "[556]\ttrain-auc:0.67312\tvalid-auc:0.63772\n", "[557]\ttrain-auc:0.67321\tvalid-auc:0.63772\n", "[558]\ttrain-auc:0.67326\tvalid-auc:0.63773\n", "[559]\ttrain-auc:0.67336\tvalid-auc:0.63774\n", "[560]\ttrain-auc:0.67341\tvalid-auc:0.63773\n", "[561]\ttrain-auc:0.67346\tvalid-auc:0.63771\n", "[562]\ttrain-auc:0.67349\tvalid-auc:0.63771\n", "[563]\ttrain-auc:0.67355\tvalid-auc:0.63771\n", "[564]\ttrain-auc:0.67364\tvalid-auc:0.63773\n", "[565]\ttrain-auc:0.67370\tvalid-auc:0.63775\n", "[566]\ttrain-auc:0.67375\tvalid-auc:0.63776\n", "[567]\ttrain-auc:0.67382\tvalid-auc:0.63778\n", "[568]\ttrain-auc:0.67391\tvalid-auc:0.63780\n", "[569]\ttrain-auc:0.67396\tvalid-auc:0.63782\n", "[570]\ttrain-auc:0.67404\tvalid-auc:0.63782\n", "[571]\ttrain-auc:0.67409\tvalid-auc:0.63785\n", "[572]\ttrain-auc:0.67416\tvalid-auc:0.63791\n", "[573]\ttrain-auc:0.67424\tvalid-auc:0.63792\n", "[574]\ttrain-auc:0.67425\tvalid-auc:0.63791\n", "[575]\ttrain-auc:0.67431\tvalid-auc:0.63791\n", "[576]\ttrain-auc:0.67438\tvalid-auc:0.63791\n", "[577]\ttrain-auc:0.67445\tvalid-auc:0.63793\n", "[578]\ttrain-auc:0.67449\tvalid-auc:0.63792\n", "[579]\ttrain-auc:0.67453\tvalid-auc:0.63792\n", "[580]\ttrain-auc:0.67459\tvalid-auc:0.63793\n", "[581]\ttrain-auc:0.67467\tvalid-auc:0.63796\n", "[582]\ttrain-auc:0.67472\tvalid-auc:0.63796\n", "[583]\ttrain-auc:0.67479\tvalid-auc:0.63796\n", "[584]\ttrain-auc:0.67484\tvalid-auc:0.63797\n", "[585]\ttrain-auc:0.67489\tvalid-auc:0.63794\n", "[586]\ttrain-auc:0.67490\tvalid-auc:0.63794\n", "[587]\ttrain-auc:0.67497\tvalid-auc:0.63794\n", "[588]\ttrain-auc:0.67502\tvalid-auc:0.63792\n", "[589]\ttrain-auc:0.67507\tvalid-auc:0.63792\n", "[590]\ttrain-auc:0.67509\tvalid-auc:0.63792\n", "[591]\ttrain-auc:0.67512\tvalid-auc:0.63793\n", "[592]\ttrain-auc:0.67517\tvalid-auc:0.63793\n", "[593]\ttrain-auc:0.67523\tvalid-auc:0.63792\n", "[594]\ttrain-auc:0.67528\tvalid-auc:0.63795\n" ] } ], "source": [ "# 3.6\n", "# train the model\n", "xgb_dask_clf=xgboost.dask.train(client=client, \n", " params=xgb_params, \n", " dtrain=dtrain,\n", " evals=[(dtrain, 'train'), (dtest, 'valid')],\n", " num_boost_round=3500,\n", " early_stopping_rounds=10, \n", " verbose_eval=True\n", " )" ] }, { "cell_type": "code", "execution_count": 26, "id": "179c11e5-208f-44f8-bc6d-d0bd844a944c", "metadata": { "tags": [] }, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# 3.6\n", "# DO NOT CHANGE THIS CELL\n", "# plot history\n", "history=xgb_dask_clf['history']\n", "\n", "plt.plot(history['train']['auc'], label='Train Score (AUC)')\n", "plt.plot(history['valid']['auc'], label='Test Score (AUC)')\n", "\n", "plt.xlabel('Iteration')\n", "plt.ylabel('Model Performance')\n", "plt.legend()\n", "plt.show()" ] }, { "cell_type": "markdown", "id": "96d92840-c3cd-4b9a-b025-c47d1bbb2f32", "metadata": {}, "source": [ "### Step 4: Model Persistence ###\n", "The last step for the assessment is to save the model for grading. You should improve the model until the AUC ROC is above 60%. Once completed, you should submit the model for grading. \n", "\n", "**Instructions**:
\n", "4.1 Execute the cell below to save the model in JSON format.
" ] }, { "cell_type": "code", "execution_count": 27, "id": "633db8ab-7c6c-48fa-ba2c-5c2ff325dd58", "metadata": { "tags": [] }, "outputs": [], "source": [ "# 4.1\n", "# DO NOT CHANGE THIS CELL\n", "# save model\n", "xgb_dask_clf['booster'].save_model('my_assessment/model.json')" ] }, { "cell_type": "markdown", "id": "aab7921b-734c-4cb1-8b91-7ab1b00892cb", "metadata": {}, "source": [ "### Grade Your Code ###\n", "If you have trained the model and completed model evaluation successfully, save changes to the notebook and revisit the webpage where you launched this interactive environment. Click on the \"**ASSESS TASK**\" button as shown in the screenshot below. Doing so will give you credit for this part of the lab that counts towards earning a certificate of competency for the entire course.\n", "\n", "

" ] }, { "cell_type": "markdown", "id": "e9e13c81-abd4-43a2-bfe8-774116b3257f", "metadata": {}, "source": [ "\"Header\"" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3 (ipykernel)", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.9.16" } }, "nbformat": 4, "nbformat_minor": 5 }