0% found this document useful (0 votes)

2 views

Vertopal.com Untitled

The document describes a Python script that reads a CSV file into a pandas DataFrame and drops specific columns. It then calculates k-means clustering on the numeric features of the DataFrame, determines the optimal number of clusters using the Elbow method, and applies k-means clustering to assign clusters to the data. Finally, it visualizes the clusters using a scatter plot.

Uploaded by

vinaybuddyy2000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Vertopal.com Untitled

Uploaded by

vinaybuddyy2000

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

import pandas as pd

import numpy as np
import matplotlib.pyplot as plt
import sklearn

df = pd.read_csv('/content/Champo data clustering.csv')

{"summary":"{\n \"name\": \"df\",\n \"rows\": 45,\n \"fields\": [\n

{\n \"column\": \"Row Labels\",\n \"properties\": {\n
\"dtype\": \"string\",\n \"num_unique_values\": 45,\n
\"samples\": [\n \"T-4\",\n \"L-4\",\n \"L-
5\"\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\": \"Sum
of QtyRequired\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 30550,\n \"min\": 2,\n
\"max\": 183206,\n \"num_unique_values\": 45,\n
\"samples\": [\n 5677,\n 776,\n 25840\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"Sum of TotalArea\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
34474.17720152305,\n \"min\": 1.35,\n \"max\":
209725.222,\n \"num_unique_values\": 45,\n \"samples\":
[\n 2811.375,\n 7.36,\n 210.0\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Sum of Amount\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
1808976.9642323288,\n \"min\": 328.8752,\n \"max\":
11341052.51,\n \"num_unique_values\": 45,\n \"samples\":
[\n 238241.0,\n 44234.0,\n 358890.0\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"DURRY\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 22160,\n
\"min\": 0,\n \"max\": 139618,\n \"num_unique_values\":
32,\n \"samples\": [\n 1560,\n 5310,\n
9950\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"HANDLOOM\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 607,\n \"min\": 0,\n
\"max\": 3673,\n \"num_unique_values\": 12,\n
\"samples\": [\n 450,\n 395,\n 1445\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"DOUBLE BACK\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
1166,\n \"min\": 0,\n \"max\": 5439,\n
\"num_unique_values\": 19,\n \"samples\": [\n 0,\n
16,\n 160\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"JACQUARD\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 175,\n \"min\": 0,\n
\"max\": 714,\n \"num_unique_values\": 19,\n
\"samples\": [\n 0,\n 6,\n 60\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"HAND TUFTED\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
9917,\n \"min\": 0,\n \"max\": 60685,\n
\"num_unique_values\": 29,\n \"samples\": [\n 1661,\n
2697,\n 3657\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"HAND WOVEN\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 2418,\n \"min\": 0,\n
\"max\": 14314,\n \"num_unique_values\": 22,\n
\"samples\": [\n 0,\n 56,\n 3000\
n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"KNOTTED\",\n \"properties\": {\n \"dtype\": \"number\",\
n \"std\": 1503,\n \"min\": 0,\n \"max\": 9502,\n
\"num_unique_values\": 14,\n \"samples\": [\n 9502,\n
350,\n 0\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\": \"GUN
TUFTED\",\n \"properties\": {\n \"dtype\": \"number\",\n
\"std\": 34,\n \"min\": 0,\n \"max\": 195,\n
\"num_unique_values\": 5,\n \"samples\": [\n 19,\n
122,\n 30\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"Powerloom Jacquard\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 1453,\n \"min\": 0,\n
\"max\": 9753,\n \"num_unique_values\": 2,\n
\"samples\": [\n 9753,\n 0\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"INDO TEBETAN\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
3,\n \"min\": 0,\n \"max\": 20,\n
\"num_unique_values\": 3,\n \"samples\": [\n 0,\n
20\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"Unnamed: 14\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": null,\n \"min\": null,\n
\"max\": null,\n \"num_unique_values\": 0,\n
\"samples\": [],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n }\n ]\
n}","type":"dataframe","variable_name":"df"}

# prompt: generate a code to drop Row Labels and Unnamed: 14 columns

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import sklearn
df = pd.read_csv('/content/Champo data clustering.csv')

# Drop 'Row Labels' and 'Unnamed: 14' columns if they exist

if 'Row Labels' in df.columns:
df = df.drop('Row Labels', axis=1)
if 'Unnamed: 14' in df.columns:
df = df.drop('Unnamed: 14', axis=1)

{"summary":"{\n \"name\": \"df\",\n \"rows\": 45,\n \"fields\": [\n

{\n \"column\": \"Sum of QtyRequired\",\n \"properties\": {\
n \"dtype\": \"number\",\n \"std\": 30550,\n
\"min\": 2,\n \"max\": 183206,\n \"num_unique_values\":
45,\n \"samples\": [\n 5677,\n 776,\n
25840\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\": \"Sum
of TotalArea\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 34474.17720152305,\n \"min\":
1.35,\n \"max\": 209725.222,\n \"num_unique_values\":
45,\n \"samples\": [\n 2811.375,\n 7.36,\n
210.0\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\": \"Sum
of Amount\",\n \"properties\": {\n \"dtype\": \"number\",\
n \"std\": 1808976.9642323288,\n \"min\": 328.8752,\n
\"max\": 11341052.51,\n \"num_unique_values\": 45,\n
\"samples\": [\n 238241.0,\n 44234.0,\n
358890.0\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"DURRY\",\n \"properties\": {\n \"dtype\": \"number\",\n
\"std\": 22160,\n \"min\": 0,\n \"max\": 139618,\n
\"num_unique_values\": 32,\n \"samples\": [\n 1560,\n
5310,\n 9950\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"HANDLOOM\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 607,\n \"min\": 0,\n
\"max\": 3673,\n \"num_unique_values\": 12,\n
\"samples\": [\n 450,\n 395,\n 1445\n
],\n \"semantic_type\": \"\",\n \"description\": \"\"\n
}\n },\n {\n \"column\": \"DOUBLE BACK\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
1166,\n \"min\": 0,\n \"max\": 5439,\n
\"num_unique_values\": 19,\n \"samples\": [\n 0,\n
16,\n 160\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"JACQUARD\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 175,\n \"min\": 0,\n
\"max\": 714,\n \"num_unique_values\": 19,\n
\"samples\": [\n 0,\n 6,\n 60\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"HAND TUFTED\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
9917,\n \"min\": 0,\n \"max\": 60685,\n
\"num_unique_values\": 29,\n \"samples\": [\n 1661,\n
2697,\n 3657\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"HAND WOVEN\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 2418,\n \"min\": 0,\n
\"max\": 14314,\n \"num_unique_values\": 22,\n
\"samples\": [\n 0,\n 56,\n 3000\
n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"KNOTTED\",\n \"properties\": {\n \"dtype\": \"number\",\
n \"std\": 1503,\n \"min\": 0,\n \"max\": 9502,\n
\"num_unique_values\": 14,\n \"samples\": [\n 9502,\n
350,\n 0\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\": \"GUN
TUFTED\",\n \"properties\": {\n \"dtype\": \"number\",\n
\"std\": 34,\n \"min\": 0,\n \"max\": 195,\n
\"num_unique_values\": 5,\n \"samples\": [\n 19,\n
122,\n 30\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"Powerloom Jacquard\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 1453,\n \"min\": 0,\n
\"max\": 9753,\n \"num_unique_values\": 2,\n
\"samples\": [\n 9753,\n 0\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"INDO TEBETAN\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
3,\n \"min\": 0,\n \"max\": 20,\n
\"num_unique_values\": 3,\n \"samples\": [\n 0,\n
20\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n }\n ]\
n}","type":"dataframe","variable_name":"df"}

# prompt: generate a code to calculate kMeans from the above data

from sklearn.cluster import KMeans

# Select features for clustering (excluding non-numeric columns if

any)
features = df.select_dtypes(include=np.number)

# Determine the optimal number of clusters (e.g., using the Elbow

method)
wcss = []
for i in range(1, 11):
kmeans = KMeans(n_clusters=i, init='k-means++', max_iter=300,
n_init=10, random_state=0)
kmeans.fit(features)
wcss.append(kmeans.inertia_)

plt.plot(range(1, 11), wcss)

plt.title('Elbow Method')
plt.xlabel('Number of Clusters')
plt.ylabel('WCSS')
plt.show()

# Based on the Elbow method plot, choose the optimal number of

clusters (e.g., k=3)
optimal_k = 3 # Replace with the value you determined from the plot

# Apply k-means clustering

kmeans = KMeans(n_clusters=optimal_k, init='k-means++', max_iter=300,
n_init=10, random_state=0)
df['cluster'] = kmeans.fit_predict(features)

# Print or visualize the results

print(df.head())

# Visualize the clusters (example for 2D data)

# If you have more than 2 features, you may need to use dimensionality
reduction techniques like PCA first
if len(features.columns) >= 2:
plt.scatter(features.iloc[:, 0], features.iloc[:, 1],
c=df['cluster'], cmap='viridis')
plt.scatter(kmeans.cluster_centers_[:, 0],
kmeans.cluster_centers_[:, 1], s=300, c='black', marker='*',
label='Centroids')
plt.title('Clusters of customers')
plt.xlabel('Feature 1') # Replace with your actual feature name
plt.ylabel('Feature 2') # Replace with your actual feature name
plt.legend()
plt.grid()
plt.show()
Sum of QtyRequired Sum of TotalArea Sum of Amount DURRY
HANDLOOM \
0 2466 139.5900 1.854041e+05 1021
1445
1 131 2086.0000 6.247460e+03 0
0
2 18923 53625.6544 1.592080e+06 3585
0
3 624 202.8987 1.481116e+04 581
0
4 464 8451.5625 5.862686e+04 0
0

DOUBLE BACK JACQUARD HAND TUFTED HAND WOVEN KNOTTED GUN TUFTED
\
0 0 0 0 0 0 0

1 25 106 0 0 0 0

2 175 714 11716 2116 617 0

3 0 2 0 41 0 0
4 459 5 0 0 0 0

Powerloom Jacquard INDO TEBETAN cluster

0 0 0 0
1 0 0 0
2 0 0 2
3 0 0 0
4 0 0 0

# prompt: generate a code to plot silhouette_scores

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import sklearn
from sklearn.cluster import KMeans
from sklearn.metrics import silhouette_score

# ... (Your existing code for data loading and preprocessing) ...

# Determine the optimal number of clusters (e.g., using the Silhouette

method)
silhouette_scores = []
for i in range(2, 11): # Silhouette score is not defined for a single
cluster
kmeans = KMeans(n_clusters=i, init='k-means++', max_iter=300,
n_init=10, random_state=0)
cluster_labels = kmeans.fit_predict(features)
silhouette_avg = silhouette_score(features, cluster_labels)
silhouette_scores.append(silhouette_avg)

# Plot silhouette scores

plt.plot(range(2, 11), silhouette_scores, marker='o')
plt.title('Silhouette Scores for Different Cluster Numbers')
plt.xlabel('Number of Clusters')
plt.ylabel('Silhouette Score')
plt.grid()
plt.show()

# Find the optimal k based on highest silhouette score

optimal_k = np.argmax(silhouette_scores) + 2 # Add 2 because range
starts from 2

# Apply k-means clustering with the optimal k

kmeans = KMeans(n_clusters=optimal_k, init='k-means++', max_iter=300,
n_init=10, random_state=0)
df['cluster'] = kmeans.fit_predict(features)

# ... (rest of your code for visualization)

Essential n8n Playbook
From Everand
Essential n8n Playbook
Leandro Calado
No ratings yet
Delhivery Mani
No ratings yet
Delhivery Mani
79 pages
CATIA V5-6R2014 For Engineers and Designers
33% (6)
CATIA V5-6R2014 For Engineers and Designers
2 pages
VoThaiThaoNhi ECON209 F2024 Lab 2
No ratings yet
VoThaiThaoNhi ECON209 F2024 Lab 2
10 pages
ML Lab-1
No ratings yet
ML Lab-1
5 pages
BD WPS2
No ratings yet
BD WPS2
23 pages
vertopal.com_Week_4
No ratings yet
vertopal.com_Week_4
13 pages
task1
No ratings yet
task1
5 pages
# Importing Necessary Libraries: Import As Import As Import As Import As
No ratings yet
# Importing Necessary Libraries: Import As Import As Import As Import As
21 pages
DACLUSTER
No ratings yet
DACLUSTER
9 pages
vertopal.com_Mlt_ann_lab_2_
No ratings yet
vertopal.com_Mlt_ann_lab_2_
7 pages
vertopal.com_IS_Extended_Project_Guided _Template_Notebook (1)
No ratings yet
vertopal.com_IS_Extended_Project_Guided _Template_Notebook (1)
26 pages
Covid_19_Analysis_and_Visualization_using_Plotly_Express
No ratings yet
Covid_19_Analysis_and_Visualization_using_Plotly_Express
11 pages
Copy of ML - Assignment
No ratings yet
Copy of ML - Assignment
7 pages
Bose A S
No ratings yet
Bose A S
37 pages
vertopal.com_Heart_Disease_Classification_Full-1
No ratings yet
vertopal.com_Heart_Disease_Classification_Full-1
3 pages
B58_ Handling Missing Values,Feature_Selection (1)
No ratings yet
B58_ Handling Missing Values,Feature_Selection (1)
4 pages
Another Copy of Ensemble Models Original Paid
No ratings yet
Another Copy of Ensemble Models Original Paid
51 pages
B58 Random Forest
No ratings yet
B58 Random Forest
4 pages
keeratsi_HW8
No ratings yet
keeratsi_HW8
17 pages
Cleaning_data - Copy
No ratings yet
Cleaning_data - Copy
6 pages
Model
No ratings yet
Model
5 pages
1 4-EDA Ipynb
No ratings yet
1 4-EDA Ipynb
12 pages
KNN For Classification
No ratings yet
KNN For Classification
5 pages
Data Mining - Project
100% (2)
Data Mining - Project
11 pages
1_linear_regression.ipynb
No ratings yet
1_linear_regression.ipynb
16 pages
21MIC0107-1
No ratings yet
21MIC0107-1
7 pages
RegresiÃ N Lineal Con Python - Ipynb
No ratings yet
RegresiÃ N Lineal Con Python - Ipynb
83 pages
Practical 5
No ratings yet
Practical 5
6 pages
prgm 4
No ratings yet
prgm 4
3 pages
Sales - Project - v3 (2) .Ipynb
No ratings yet
Sales - Project - v3 (2) .Ipynb
1,230 pages
1 Abril PDF
No ratings yet
1 Abril PDF
10 pages
ML Practical 4D
No ratings yet
ML Practical 4D
11 pages
Lab
No ratings yet
Lab
13 pages
775 (Copy)
No ratings yet
775 (Copy)
5 pages
Simple Linear Regression PDF
No ratings yet
Simple Linear Regression PDF
40 pages
Vertopal.com Copy of Data Syringe Pump
No ratings yet
Vertopal.com Copy of Data Syringe Pump
60 pages
QuickAdapterV35
No ratings yet
QuickAdapterV35
8 pages
1 Introduction To Statsmodels
No ratings yet
1 Introduction To Statsmodels
28 pages
Merged
No ratings yet
Merged
35 pages
769 (Copy)
No ratings yet
769 (Copy)
5 pages
Mlext
No ratings yet
Mlext
1 page
Arima Text
No ratings yet
Arima Text
49 pages
Az4 Ipynb
No ratings yet
Az4 Ipynb
17 pages
772 (Copy)
No ratings yet
772 (Copy)
5 pages
Social Network Analysis: Cheruvu Nvss Suhas 21BCE8374
No ratings yet
Social Network Analysis: Cheruvu Nvss Suhas 21BCE8374
10 pages
1722414346054
No ratings yet
1722414346054
18 pages
DSBDA1
No ratings yet
DSBDA1
5 pages
One Hot Encoding
No ratings yet
One Hot Encoding
12 pages
Python Dictionary Datatype Practical Notes
No ratings yet
Python Dictionary Datatype Practical Notes
6 pages
TCS Stock Data - Live and Latest-checkpoint.ipynb
No ratings yet
TCS Stock Data - Live and Latest-checkpoint.ipynb
172 pages
House Prices.ipynb
No ratings yet
House Prices.ipynb
23 pages
Data Visualization EDA-print
No ratings yet
Data Visualization EDA-print
18 pages
#Group: B (ML) : Numpy NP Pandas PD
No ratings yet
#Group: B (ML) : Numpy NP Pandas PD
9 pages
PRG 4
No ratings yet
PRG 4
2 pages
Pandas & Mysql
No ratings yet
Pandas & Mysql
20 pages
Lab1 Features Selections-Class-GI2
No ratings yet
Lab1 Features Selections-Class-GI2
25 pages
Data_Analyzer
No ratings yet
Data_Analyzer
10 pages
What Can You Do With Dataframes Using Pandas?: Pandas Is A High-Level Data Manipulation Tool Developed by Wes Mckinney
No ratings yet
What Can You Do With Dataframes Using Pandas?: Pandas Is A High-Level Data Manipulation Tool Developed by Wes Mckinney
10 pages
A926534728_28953_8_2025_SPARK MLLIB
No ratings yet
A926534728_28953_8_2025_SPARK MLLIB
8 pages
Practise_numerical_on_Companies_Final_Accounts
No ratings yet
Practise_numerical_on_Companies_Final_Accounts
7 pages
Questions- FinCom[1]
No ratings yet
Questions- FinCom[1]
8 pages
Vinay Prakash Tupe_202401312_FIG Sponsor pitch task
No ratings yet
Vinay Prakash Tupe_202401312_FIG Sponsor pitch task
1 page
Event-wise_Requirement
No ratings yet
Event-wise_Requirement
5 pages
Nata Supermarket
No ratings yet
Nata Supermarket
238 pages
Clustering Mall Data Students
No ratings yet
Clustering Mall Data Students
11 pages
Bonnal Et Al 2002 the Life Cycle of Technical Projects
No ratings yet
Bonnal Et Al 2002 the Life Cycle of Technical Projects
8 pages
Complete Guide to studying WSET Level 2 in 2021 — Rachel von Sturmer
No ratings yet
Complete Guide to studying WSET Level 2 in 2021 — Rachel von Sturmer
1 page
Annexure-I: Ord. Bos - Ces: 21-9-2020
No ratings yet
Annexure-I: Ord. Bos - Ces: 21-9-2020
92 pages
Architecture Books
No ratings yet
Architecture Books
8 pages
QP - CB - IV - Computer Studies - CBQ - 1
No ratings yet
QP - CB - IV - Computer Studies - CBQ - 1
2 pages
Toyota Hilux 4x4
No ratings yet
Toyota Hilux 4x4
4 pages
CO PO Format (Probability and Statistics, BCADS-3rd SEM)
No ratings yet
CO PO Format (Probability and Statistics, BCADS-3rd SEM)
7 pages
8P
No ratings yet
8P
4 pages
Part 1 - MATLAB Fundamentals and Programming
No ratings yet
Part 1 - MATLAB Fundamentals and Programming
50 pages
class 4th paper mid
No ratings yet
class 4th paper mid
2 pages
Yair V Mattes
No ratings yet
Yair V Mattes
19 pages
Financial Statements-Kingsley Akinola
No ratings yet
Financial Statements-Kingsley Akinola
4 pages
Aula 3 - Introdução À Chocolateria
No ratings yet
Aula 3 - Introdução À Chocolateria
12 pages
Annexure Number Seal Statement
No ratings yet
Annexure Number Seal Statement
2 pages
MGT490 Final Report Bashundhara
No ratings yet
MGT490 Final Report Bashundhara
23 pages
Café Coffee Day: Corporate Profile
No ratings yet
Café Coffee Day: Corporate Profile
10 pages
Ee 16-03 PDF
No ratings yet
Ee 16-03 PDF
29 pages
Rebekah Jones OIG Report
No ratings yet
Rebekah Jones OIG Report
28 pages
Failure Log File
No ratings yet
Failure Log File
12 pages
Yearly Status Report - 2016-2017: Data of The Institution
No ratings yet
Yearly Status Report - 2016-2017: Data of The Institution
16 pages
Programming Guideline DOCU v14 en PDF
No ratings yet
Programming Guideline DOCU v14 en PDF
109 pages
Surat Pesanan Apotek
No ratings yet
Surat Pesanan Apotek
1 page
CHAPTER_-1[1]
No ratings yet
CHAPTER_-1[1]
30 pages
Eprepare Etea Quick Revision Session 2023
No ratings yet
Eprepare Etea Quick Revision Session 2023
2 pages
JUNIPER JN0-210: Juniper JNCIA Cloud Certification Questions & Answers
No ratings yet
JUNIPER JN0-210: Juniper JNCIA Cloud Certification Questions & Answers
6 pages
Euclid Bams 1183535407
No ratings yet
Euclid Bams 1183535407
4 pages
Subregistrar
No ratings yet
Subregistrar
11 pages
Fire Danger Index Efficiency As A Function of Fuel Moisture and Fire Behavior
No ratings yet
Fire Danger Index Efficiency As A Function of Fuel Moisture and Fire Behavior
7 pages
Learning Action Cell
No ratings yet
Learning Action Cell
3 pages
The Case of Professional Kindness in Teaching
No ratings yet
The Case of Professional Kindness in Teaching
2 pages

Vertopal.com Untitled

Uploaded by

Vertopal.com Untitled

Uploaded by

import pandas as pd

df = pd.read_csv('/content/Champo data clustering.csv')

{"summary":"{\n \"name\": \"df\",\n \"rows\": 45,\n \"fields\": [\n

# prompt: generate a code to drop Row Labels and Unnamed: 14 columns

# Drop 'Row Labels' and 'Unnamed: 14' columns if they exist

{"summary":"{\n \"name\": \"df\",\n \"rows\": 45,\n \"fields\": [\n

# prompt: generate a code to calculate kMeans from the above data

from sklearn.cluster import KMeans

# Select features for clustering (excluding non-numeric columns if

# Determine the optimal number of clusters (e.g., using the Elbow

plt.plot(range(1, 11), wcss)

# Based on the Elbow method plot, choose the optimal number of

# Apply k-means clustering

# Print or visualize the results

# Visualize the clusters (example for 2D data)

2 175 714 11716 2116 617 0

Powerloom Jacquard INDO TEBETAN cluster

# prompt: generate a code to plot silhouette_scores

# Determine the optimal number of clusters (e.g., using the Silhouette

# Plot silhouette scores

# Find the optimal k based on highest silhouette score

# Apply k-means clustering with the optimal k

# ... (rest of your code for visualization)

You might also like