How to evaluate AI/ML ?
Learn with YouTube and AI
With the widespread of AI since early 2023, I have tried a little bit of new approach of study/learning utilizing various AI solutions. In this section, I am trying to pick up some of the YouTube materials that looks informative (at least) to me. The contents that I am sharing in this section is created as follows.
- Watch the full contents of YouTube material myself
-
NOTE : This is essential since there are a lot of visual material that cannot be shared by the summary and also some details not captured by summary. If you skip this step, nothing would go through your brain... it would just go through YouTube and directly through AI. Then, AI would learn but you would not :) - Get the transcript from YouTube (As of 2023, YouTube provide the built-in function to generate the transcript for the video)
- Copy the transcribe, save it into a text file. Paste the text file into chatGPT (GPT 4) and requested summary (NOTE : If you do not subscribe chatGPT paid version, you may try it with claude ai.)
Reference
- What is model performance evaluation? - fiddler
- Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings - LMSYS ORG (May 2023)
- LLM Evaluation: How To Build, Benchmark Evals - Arize (Oct 2023)
- How to Evaluate a Large Language Model (LLM) ? - Analytics Vidhya (Nov 2023)
- Large Language Model Evaluation in 2024: 5 Methods - AIMultiple (Jan 2024)
YouTube
- Machine Learning Model Evaluation Metrics - Anaconda, Inc (2019)
- How to evaluate ML models | Evaluation metrics for machine learning - AssemblyAI (2022)
- Machine Learning Evaluation - Computational Thinking (2022)