Article
-
Chapter 11
[AI from Scratch] Episode 310: Hidden Markov Model (HMM) — Classical Speech Recognition Model Explained
Recap and Today's Theme Hello! In the previous episode, we explored the basics of speech recognition, understanding how audio is converted into text and how different components like acoustic models and language models work together. Tod... -
Chapter 11
[AI from Scratch] Episode 308: Preprocessing Audio Data — Normalization and Filtering Techniques
Recap and Today's Theme Hello! In the previous episode, we discussed noise reduction, learning various techniques to remove noise from audio data to improve its quality. Noise reduction is an essential step in enhancing the overall clari... -
Chapter 11
[AI from Scratch] Episode 309: Introduction to Speech Recognition — Converting Speech into Text
Recap and Today's Theme Hello! In the previous episode, we covered audio preprocessing techniques such as normalization, filtering, and adjusting the sampling rate. These steps are essential for improving the quality of audio data and en... -
Chapter 11
[AI from Scratch] Episode 306: Mel-Frequency Cepstral Coefficients (MFCC) — Extracting Audio Features
Recap and Today's Theme Hello! In the previous episode, we explored spectrograms, learning how to break down audio signals into frequency components and display them over time. Spectrograms are a crucial tool for visually understanding t... -
Chapter 11
[AI from Scratch] Episode 307: Noise Reduction — Techniques for Removing Noise from Audio Data
Recap and Today's Theme Hello! In the previous episode, we discussed Mel-Frequency Cepstral Coefficients (MFCC), a crucial tool for extracting features from audio data used in speech recognition and acoustic analysis. Today, we’ll focus ... -
Chapter 11
[AI from Scratch] Episode 304: Visualizing Waveform Data — How to Graph Audio Signals
Recap and Today's Theme Hello! In the previous episode, we explored LibROSA, a powerful Python library for audio processing. We learned how to load, play, and extract features from audio files easily. Now, it's time to move on to a more ... -
Chapter 11
[AI from Scratch] Episode 305: What is a Spectrogram? — Visualizing Frequency Components Over Time
Recap and Today's Theme Hello! In the previous episode, we explored waveform visualization, learning how to display audio signals over time as changes in amplitude. By visualizing waveforms, we could understand the intensity and certain ... -
Chapter 11
[AI from Scratch] Episode 303: Introduction to LibROSA — Basics of the Audio Processing Library
Recap and Today's Theme Hello! In the previous episode, we covered the basics of audio data, discussing key concepts such as sampling rate and bit depth. Understanding these fundamentals allows you to handle audio data properly and manag... -
Chapter 11
[AI from Scratch] Episode 302: Basics of Audio Data — Understanding Sampling Rate and Bit Depth
Recap and Today's Theme Hello! In the previous episode, we discussed audio processing and covered the foundational technologies of digital audio, including speech recognition and speech synthesis. We learned about how audio is digitized ... -
Chapter 11
[AI from Scratch] Episode 301: What is Speech Processing? — A Guide to Working with Audio Data
Recap and Today's Theme Hello! In the previous episode, we summarized Chapter 10 and conducted a knowledge check to review and deepen our understanding. Now, we’re moving into Chapter 11, where we will learn about speech recognition and ... -
Chapter 10
[AI from Scratch] Episode 298: Anomaly Detection — Techniques for Detecting Anomalies in Surveillance Camera Footage
Recap and Today's Theme Hello! In the previous episode, we explored video data analysis, covering various methods for extracting information from videos and their applications. Video analysis is widely used in fields such as surveillance... -
Chapter 10
[AI from Scratch] Episode 299: Challenges and Future Prospects of Computer Vision
Recap and Today's Theme Hello! In the previous episode, we explored anomaly detection techniques, focusing on how to detect unusual behaviors in surveillance videos. This technology plays a crucial role in security systems, using both ru... -
Chapter 10
[AI from Scratch] Episode 297: Analyzing Video Data — Techniques for Extracting Information from Videos
Recap and Today's Theme Hello! In the previous episode, we discussed the processing of point cloud data, focusing on how to acquire 3D scan data, remove noise, and convert the data into a mesh. Point cloud data plays a critical role in v... -
Chapter 10
[AI from Scratch] Episode 295: Fundamentals of 3D Vision — Handling Depth Information
Recap and Today's Theme Hello! In the previous episode, we discussed evaluation metrics in computer vision, such as accuracy, IoU, and mAP, which are crucial for evaluating model performance in various tasks. Today, we will explore the b... -
Chapter 10
[AI from Scratch] Episode 296: Processing Point Cloud Data — How to Handle 3D Scanned Data
Recap and Today's Theme Hello! In the previous episode, we introduced the basics of 3D vision, discussing various methods for obtaining depth information such as stereo vision, LiDAR, and ToF cameras. These technologies allow us to captu...
