Machine Learning Algorithms and Use Cases

Classified in Computers

Written at on English with a size of 33.21 KB.

AlgorithmTrain or TestUse CasesSupervisedPipe?File TypeCPU or GPU
AutoGluon-Tabulartraining and (optionally) validationAutoGluon-Tabular succeeds by ensembling multiple models and stacking them in multiple layers.YNCSVCPU or GPU (single instance only) M5
BlazingTexttrainText Classification can be used to solve various use-cases like sentiment analysis, spam detection, hashtag predictionYYText file (one sentence per line with space-separated tokens)CPU or GPU (single instance only) M5
CatBoosttraining and (optionally) validationGradientBoosting Regression Neural Network, Gradient Boosting is best useful when the number of dimensions in the data is less , when a simple linear model performs very badly,YNCSVCPU (single instance only)
DeepAR Forecastingtrain and (optionally) testTime series Data. Good for cold start problems where the dataset miught not be available.YNJSON Lines or ParquetCPU or GPU
Factorization Machinestrain and (optionally) testSupervised algo for sparse datasets. RMSE for regression, log loss for binary classificationYYrecordIO-protobuf float32CPU (GPU for dense data) m5
Image Classification - MXNettrain and validation, (optionally) train_lst, validation_lst, and modelMXNet offers faster calculation speeds and resource utilisation on GPU.YYrecordIO or image files (.Jpg or .Png)GPU
Image Classification - TensorFlowtraining and validationIn comparison, TensorFlow is inferior; however, the latter performs better on CPU.YFileimage files (.Jpg, .Jpeg, or .Png)CPU or GPU
IP Insightstrain and (optionally) validationFlagging IP addressesNFileCSVCPU or GPU
K-Meanstrain and (optionally) testClusteringNYrecordIO-protobuf or CSVCPU or GPUCommon (single GPU device on one or more instances)
K-Nearest-Neighbors (k-NN)train and (optionally) testText mining, Facial recognition. Good for small dataset. Requires feature scaling.YYrecordIO-protobuf or CSVCPU or GPU (single GPU device on one or more instances)
LDAtrain and (optionally) testText classifiction. Uses Statistics.YYrecordIO-protobuf or CSVCPU (single instance only)
LightGBMtraining and (optionally) validationGradient boosting frameworkYFileCSVCPU (single instance only)
Linear Learnertrain and (optionally) validation, test, or bothregression or classificationYYrecordIO-protobuf or CSVCPU or GPU
Neural Topic Modeltrain and (optionally) validation, test, or bothText classifiction. Uses Neural Networks. Better.YYrecordIO-protobuf or CSVCPU or GPU
Object2Vectrain and (optionally) validation, test, or bothcan analyze images or paragraphs and provdide relationships between themYFileJSON LinesCPU or GPU (single instance only)
Object Detectiontrain and validation, (optionally) train_annotation, validation_annotation, and modelYYrecordIO or image files (.Jpg or .Png)GPU
PCAtrain and (optionally) testDimensionality ReductionNYrecordIO-protobuf or CSVCPU or GPU
Random Cut Foresttrain and (optionally) testOutliers and ForecastingNYrecordIO-protobuf or CSVCPU
Semantic Segmentationtrain and validation, train_annotation, validation_annotation, and (optionally) label_map and modelImage classification with no boundaries. Autonomous vehicles.YYImage filesGPU (single instance only)
Seq2Seq Modelingtrain, validation, and vocabsolve complex Language problems like Machine Translation, Question Answering, creating Chatbots, Text SummarizationYFilerecordIO-protobuf integer tokens not floatGPU (single instance only)
TabTransformertraining and (optionally) validationTransforming Categorical features to achieve higher accuracyYFileCSVCPU or GPU (single instance only)
XGBoost (0.90-1, 0.90-2, 1.0-1, 1.2-1, 1.2-21)train and (optionally) validationy. It provides parallel tree boosting and is the leading machine learning library for regression, classification, and ranking problems.YYCSV, LibSVM, or ParquetCPU (or GPU for 1.2-1)

Entradas relacionadas: