Substantial

Machine Learning Series

A collection of three machine learning projects exploring evolutionary algorithms, natural language processing, and audio classification techniques

Timeline

September 2021May 2022

Key Technologies

PythonTensorFlowPyTorchComputer Vision+4 more

Project Overview

This series represents my exploration into different domains of machine learning, from computer vision and robotics to natural language processing and audio analysis. Each project tackles unique challenges and demonstrates different ML approaches and techniques.
Machine Learning Series - exploring computer vision, NLP, and audio processing

Machine Learning Series - exploring computer vision, NLP, and audio processing

Project 1: Dexterous Tree

The "Dexterous Tree" robot is a segmented pillar with a neural network that developed using evolutionary techniques to dodge objects in its environment when falling. This project explores how evolutionary algorithms can train neural networks for dynamic obstacle avoidance and adaptive movement in robotic systems.

Click to view documentation Dexterous manipulation system demonstrating advanced robotic interaction capabilities.

Project 2: Sound Classification

This sound classification algorithm utilizes a convolutional neural network to distinguish the source and environment of sampled audio clips. The system demonstrates the application of deep learning to acoustic pattern recognition, enabling automatic identification and categorization of different audio signals based on their contextual characteristics.

Click to view documentation CNN architecture design for audio classification showing feature extraction and classification layers

Project 3: Lyric Generation (GAN)

A natural language processing project that uses bidirectional LSTMs to learn an artist's style from their most popular work and reproduce a new set of lyrics for the user. This system analyzes patterns in creative writing and generates coherent, stylistically consistent text that mimics the learned artistic style.

Click to view documentation Training progression showing output quality improvement across epochs in the lyric generation GAN

Technical Approach

Each project represents a different branch of machine learning: computer vision for robotic manipulation, generative models for creative content, and signal processing for audio classification. Together, they demonstrate the versatility and wide-ranging applications of modern ML techniques.

Key Learning Outcomes

This series provided hands-on experience with multiple ML frameworks, data preprocessing techniques, model architecture design, and performance optimization. The projects also highlighted the importance of domain-specific considerations when applying machine learning to real-world problems.

Key Features

  • Three distinct ML domains explored
  • Computer vision for robotic manipulation
  • GAN-based creative text generation
  • Audio classification using CNNs
  • Cross-domain ML technique comparison
  • End-to-end project development

Technologies

Python
TensorFlow
PyTorch
Computer Vision
NLP
Audio Processing
GANs
CNNs