Rob Mulla
Rob Mulla
  • 157
  • 11 733 018
Making Sports Predictions with Data Science
🏆 WIN a NVIDIA GeForce RTX 4080 Super GPU! Register now: forms.gle/47vUHzzz2aqJoP1S9
Dive into the fascinating world of machine learning and AI as we guide you through developing a model designed to predict NCAA tournament outcomes. From initial setup to final predictions, we’ll cover everything you need to create your own powerhouse model.
Embark on this machine learning adventure to not only enhance your NCAA bracket but also to deepen your understanding of predictive modeling. Let’s transform your bracket into a data-driven masterpiece together!
📘 Access the Notebook: www.kaggle.com/robikscube/machine-learning-bracket-gpu-powered
Timeline:
00:00 Intro
01:06 Setup and Import
02:39 Data Understanding
05:23 Data Pipeline Goal
06:04 Step 1- Season Data
11:07 Step 2- Tourney Data
15:30 Chalk Baseline
17:51 XGBoost ML Model
21:30 Predicting Matchups
23:50 Creating the Bracket
Links to my stuff:
* UA-cam: youtube.com/@robmulla?sub_confirmation=1
* Discord: discord.gg/HZszek7DQc
* Twitch: www.twitch.tv/RobCodesLIVE
* Twitter: Rob_Mulla
* Kaggle: www.kaggle.com/robikscube
Переглядів: 14 886

Відео

Build Your First Pytorch Model In Minutes! [Tutorial + Code]
Переглядів 66 тис.8 місяців тому
Build Your First Pytorch Model In Minutes! [Tutorial Code]
Learning Pandas for Data Analysis? Start Here.
Переглядів 81 тис.9 місяців тому
Learning Pandas for Data Analysis? Start Here.
ASL Fingerspelling Demo - #kaggle Competition Solution
Переглядів 8 тис.9 місяців тому
ASL Fingerspelling Demo - #kaggle Competition Solution
100% Offline ChatGPT Alternative?
Переглядів 625 тис.11 місяців тому
100% Offline ChatGPT Alternative?
SQL Databases with Pandas and Python - A Complete Guide
Переглядів 110 тис.Рік тому
SQL Databases with Pandas and Python - A Complete Guide
Unbelievable Face Swapping with 5 Lines Code
Переглядів 68 тис.Рік тому
Unbelievable Face Swapping with 5 Lines Code
Predict My Sleep with Data Science.
Переглядів 7 тис.Рік тому
Predict My Sleep with Data Science.
Pandas 2.0 : Everything You Need to Know
Переглядів 120 тис.Рік тому
Pandas 2.0 : Everything You Need to Know
The BEST library for building Data Pipelines...
Переглядів 70 тис.Рік тому
The BEST library for building Data Pipelines...
Open Source Face Analysis with Python
Переглядів 47 тис.Рік тому
Open Source Face Analysis with Python
Polars: The Next Big Python Data Science Library... written in RUST?
Переглядів 166 тис.Рік тому
Polars: The Next Big Python Data Science Library... written in RUST?
The Top 5 A.I. Stories of 2022
Переглядів 4 тис.Рік тому
The Top 5 A.I. Stories of 2022
3 Simple Ways ChatGPT Can Make You a Better Coder
Переглядів 26 тис.Рік тому
3 Simple Ways ChatGPT Can Make You a Better Coder
ChatGPT can do what? Goodbye Stack Overflow...
Переглядів 7 тис.Рік тому
ChatGPT can do what? Goodbye Stack Overflow...
Scrape Twitter with 5 Lines of Code
Переглядів 66 тис.Рік тому
Scrape Twitter with 5 Lines of Code
Run Stable Diffusion 2.0 Locally & Create AI Art
Переглядів 12 тис.Рік тому
Run Stable Diffusion 2.0 Locally & Create AI Art
Forecasting with the FB Prophet Model
Переглядів 73 тис.Рік тому
Forecasting with the FB Prophet Model
The Ultimate Coding Setup for Data Science
Переглядів 106 тис.Рік тому
The Ultimate Coding Setup for Data Science
Data Visualization BATTLE!
Переглядів 18 тис.Рік тому
Data Visualization BATTLE!
Yolov7 Custom Object Detection in Python Tutorial - Chess Piece Detection
Переглядів 53 тис.Рік тому
Yolov7 Custom Object Detection in Python Tutorial - Chess Piece Detection
OpenAI Whisper Demo: Convert Speech to Text in Python
Переглядів 100 тис.Рік тому
OpenAI Whisper Demo: Convert Speech to Text in Python
25 Nooby Pandas Coding Mistakes You Should NEVER make.
Переглядів 261 тис.Рік тому
25 Nooby Pandas Coding Mistakes You Should NEVER make.
Kaggle Dataset Creation from Scratch- Data Science Uncut (Aug 10 2022)
Переглядів 7 тис.Рік тому
Kaggle Dataset Creation from Scratch- Data Science Uncut (Aug 10 2022)
Data Science Uncut - Data Shootout Kaggle Competition (Aug 1 2022 Stream)
Переглядів 29 тис.Рік тому
Data Science Uncut - Data Shootout Kaggle Competition (Aug 1 2022 Stream)
Time Series Forecasting with XGBoost - Advanced Methods
Переглядів 112 тис.Рік тому
Time Series Forecasting with XGBoost - Advanced Methods
Do these Pandas Alternatives actually work?
Переглядів 14 тис.Рік тому
Do these Pandas Alternatives actually work?
Solving an Impossible Riddle with Code
Переглядів 7 тис.Рік тому
Solving an Impossible Riddle with Code
Detect Text in Images with Python - pytesseract vs. easyocr vs keras_ocr
Переглядів 102 тис.Рік тому
Detect Text in Images with Python - pytesseract vs. easyocr vs keras_ocr
Time Series Forecasting with XGBoost - Use python and machine learning to predict energy consumption
Переглядів 380 тис.Рік тому
Time Series Forecasting with XGBoost - Use python and machine learning to predict energy consumption

КОМЕНТАРІ

  • @jvilbre
    @jvilbre 19 годин тому

    list in python is also a stack

  • @seanflorida5957
    @seanflorida5957 День тому

    Just getting adict to your contens now. Sharp and easy to follow. Thank you very much

  • @carmizadok5491
    @carmizadok5491 День тому

    Amazing

  • @1811artur
    @1811artur День тому

    very usefull

  • @Arkantosi
    @Arkantosi 2 дні тому

    Rob, are you aware that you have made a crucial forecasting mistake? You used the test set for validating the model when fitting, then you used the same test set when you made the final predictions and evaluated it on the same set. The problem is that during the fitting, the model gets to see the test set so you have data leaked into the past, from the future. What you should do is to split the data into train/val/test where the test has never been seen by the model.

  • @syalin4433
    @syalin4433 2 дні тому

    You are trying too hard

  • @user-kl5nx9qy8o
    @user-kl5nx9qy8o 3 дні тому

    What about using these cross validation objects within GridSearchCV, how would you treat the creation of new features for each fold? Would you have to sacrifice using GridSearchCV to be able to apply feature engineering with aggregate data on each fold? Your approach allows feature engineerimg on each fold, but it does not allow the benefit of hyperparameter tuning as with gridsearchcv. Would the best advice be to use your approach of cross validation and apply in each fold loop with feature engineering, but with the drawback of relying on manual hyperparameter tuning instead f methods as gridsearchcv or randomsearchcv?

  • @DmiHindi
    @DmiHindi 3 дні тому

    Really must watch video. I must say that I really like the way you explain things, it's so easy to understand and follow along. Thank you!

  • @user-kl5nx9qy8o
    @user-kl5nx9qy8o 3 дні тому

    Amazing. Really covered very well most of my questions from the firat video. This is the best video I have seen so far about xgboost that integrates time series data with feature engineering that uses aggregate data and shows clearly how to use it with cross-validation appropriately to avoid data leakage. I would make a 3rd part where you focus on integrating everything within a preprocessing pipeline along with hyperparameter tuning using GridSearchCV or any other method in order to make it more robust. Either way you showed me a new way to use

  • @user-iz7kw1si2e
    @user-iz7kw1si2e 3 дні тому

    Thanks! U remember me about enumerate function❤

  • @user-kl5nx9qy8o
    @user-kl5nx9qy8o 3 дні тому

    Thanks so much for this video. It would be cool as well to see a video with xgboost mainly about feature engineering using aggregate data(for example the average of the last 30 days) while using cross validation appropriately to avoid data leakage. Would hyperparameter tuning with GridSearchCV would have to be sacrificed since you can't easily control creating these features using aggregate data within each dataset split made in the cross-validation? Thanks so much for your enlightening and amazing videos. I highly appreciate your work.

  • @giraffa-analytics
    @giraffa-analytics 3 дні тому

    Linear regression is a bad choice for this kind of data. This is a time series problem. cumulative sums introduce autocorrelation between datapoints, which violates the assumption of iid residuals of linear regression. Code ok, reasoning not.

  • @pompymandislian5628
    @pompymandislian5628 3 дні тому

    i think predict forecasthing not suitable with XGboost model, because xgboost just using method like decision tree (it's not interpretation), if you want using machine learning I recommend using deep learning like LSTM, GRU, RNN

  • @kisho2679
    @kisho2679 4 дні тому

    how can an external LaTex file be called/included/embedded in a JupyterLab cell?

  • @Lnd2345
    @Lnd2345 5 днів тому

    What does the max function do? Doesn’t each video have a single view count?

  • @chillyhumor
    @chillyhumor 5 днів тому

    Where is or where to take the dataset

  • @AI_Reflection
    @AI_Reflection 6 днів тому

    Hello again

  • @itsfarseen
    @itsfarseen 6 днів тому

    Don't watch. Waste of time. The whole video is just import timm; timm.Model().forward();

  • @itsfarseen
    @itsfarseen 6 днів тому

    Don't watch. Waste of time. The whole video is just import timm; timm.Model().forward();

    • @itsfarseen
      @itsfarseen 6 днів тому

      timm is a library with already made models. He basically wrote two lines of code. He doesn't show how to make a model

  • @farhadnikhashemi8681
    @farhadnikhashemi8681 6 днів тому

    Your shared videos are fantastic, Rob. Well done. By the way, I applied the above, but I was unable to scrape the data, assuming that Twitter does not allow escape without API. If anyone has done this successfully, please advise.

  • @itsfarseen
    @itsfarseen 6 днів тому

    You robbed my precious 2 hrs rob mulla. I watched your How to Pytorch video hoping you'd show me how to build a model using Pytorch. Then I watched this video as well. Thanks for scamming me of my precious time and attention. You just showed how to import timm and call forward() on it. You didn't teach me how to make a model using Pytorch. Thank you so much 🙏 keep stealing everyone's time.

  • @Champalampa
    @Champalampa 6 днів тому

    Woops I’m a double noob

  • @kushagra54
    @kushagra54 7 днів тому

    Amazing video, you made it so simple

  • @siddharthashukla952
    @siddharthashukla952 7 днів тому

    Sorry to say but you should give more details about how you the process.

  • @cupidonsauce1208
    @cupidonsauce1208 7 днів тому

    Ah yes, writing unconventional code which can't be found in other mains stream languages is somewhat less noob than writing some code which is already clear and can be written in most languages Truly stupid -_- don't call people noob, it's a noob thing to do

  • @Caelghoul
    @Caelghoul 7 днів тому

    In general indexing is slow in df numpy faster

  • @whendoyoulift5602
    @whendoyoulift5602 8 днів тому

    if you are facing error at 4:41 try pd.set_option('display.max_columns',200) instead of pd.set_option('max_columns',200)

  • @BritainW3llz
    @BritainW3llz 8 днів тому

    ways_to_code = ['range', 'counter', 'enumerate'] noob = 'Stop Doing It This Way' for funny_jokes in ways_to_code: if funny_jokes == 'range' or funny_jokes =='counter': print(noob) else: print('Okay, you\'re alright')

  • @TTOmiTTom
    @TTOmiTTom 8 днів тому

    Rob please don't stop making videos. I just discovered your channel and my universe just expanded!!! Awesome stuff!

  • @vt100music
    @vt100music 8 днів тому

    hey, does deepface work without the internet? like can i install it on a machine, then unplug my ethernet cable, and it will still work or does it need to talk to the cloud somehow?

  • @_RahulDewangan
    @_RahulDewangan 8 днів тому

    This video came many times in my home page recommendation but i was ignoring cause it's just a 22 min video.....but boom 🤯 ! Omg not even a 2 hours video of others cover all these.❤ thanks

    • @robmulla
      @robmulla 8 днів тому

      Love that! Share it with a friend or two.

  • @ryanblumenow
    @ryanblumenow 8 днів тому

    Can this talk with, summarise; and analyse excel or csv files?

  • @SiracEditz4427
    @SiracEditz4427 9 днів тому

    I know ur sons name is Daniel

  • @Xsnake237
    @Xsnake237 9 днів тому

    ur kids name is daniel

  • @Xsnake237
    @Xsnake237 9 днів тому

    ur not safe

  • @Xsnake237
    @Xsnake237 9 днів тому

    they live in potomac maryland

  • @Xsnake237
    @Xsnake237 9 днів тому

    ur kids name is daniel

  • @amazman977
    @amazman977 9 днів тому

    Thanks. Easy to follow you.

  • @user-sy4ec3em5o
    @user-sy4ec3em5o 10 днів тому

    Do you code in python... learn a real language like C and C++... Then go back to python and you will be irritated relentlessly by the loosely typed nature of the language and you will go back to C because it makes you feel like a boss

  • @bandhalaraja3481
    @bandhalaraja3481 10 днів тому

    Better we can pass the value into ChartJS and create whatever you want. It's been 2 years, I try to build a good dashboards using flask, I never find any good solutions yet.

  • @chasegillis5981
    @chasegillis5981 10 днів тому

    Learned a lot, subscribed 🤙🏻

  • @Chrollo-gk3oe
    @Chrollo-gk3oe 11 днів тому

    Don’t ducking judge me. I identify as a professional👍

  • @AnastasiyaKaliutchyk
    @AnastasiyaKaliutchyk 11 днів тому

    Just brilliant! Thank you so much!

    • @robmulla
      @robmulla 11 днів тому

      You're very welcome!

  • @samranbeytollahi
    @samranbeytollahi 11 днів тому

    Thanks can I ask what editor do you use?in this video

  • @barmalini
    @barmalini 11 днів тому

    I really hate these stock video interruptions

    • @robmulla
      @robmulla 11 днів тому

      What do you mean? The advertisements from youtube?

  • @poojarai7336
    @poojarai7336 12 днів тому

    hii can i do the same in jupyter notebook? as i work on that

  • @RamilTeodosio
    @RamilTeodosio 12 днів тому

    You got me at "I'm a data scientist"

  • @ChopLancer
    @ChopLancer 13 днів тому

    So if i change my code tomorrow I won't be a double noob anymore?

  • @NS-cp5bx
    @NS-cp5bx 13 днів тому

    Thank you Rob!! Very nice explanation of Jupyter options and environments!

  • @onyinyeobijiofor7075
    @onyinyeobijiofor7075 13 днів тому

    I loved it. Thank you.