Wafer Scratch Detection - Deep Learning Solution

This project implements a deep learning pipeline to detect scratches in semiconductor wafers using a custom U-Net model with fixed coordinate image generation and flexible training/prediction flows.

The model class is designed to be simple to use, encapsulated in a single Python class named Model, located in Model/model.py.

🔧 How to Use

1. Import the Model Class

from Model.model import Model

2. Initialize the Model with Training Data

model = Model(df_train=df_wafers)

3. Generate Image Data from Wafer Map

model.makeData()  # Converts tabular data into 256x256 fixed-coordinate .pt images

4. Train the Model

model.train(epoch=10)

5. Make Predictions

IsScratchDie = model.predict(df_wafers_test, yield_threshold=0.7)
df_wafers_test['IsScratchDie'] = IsScratchDie

6. Evaluate Results

model.evaluate_die_level(df_wafers['IsScratchDie'].values, IsScratchDie)

📁 Folder Structure

project_root/
├── Model/
│   ├── model.py           # Model class with training and inference logic
│   ├── SaveModel/         # Stores best_model.pth, optimizer, val loss
├── datamap/
│   ├── Waferclass.py      # Dataset and caching logic
│   └── data/cache/        # Cached training images and labels
│   └── data/test/         # Cached test images

Key Features

Fixed Coordinate Image Grid

Each wafer is converted to a 256x256 tensor.
Coordinates are mapped directly based on DieX, DieY offset.
This improves stability and removes ambiguity when mapping predictions back to wafer-level CSV.

Training Loop

Saves best model, loss, and optimizer states.
Early stopping with patience.
Automatically resumes training if continue_training=True.

Wafer Filtering

Removes low-yield wafers (e.g., yield < 0.7) before predicting.

✅Batch Prediction Flow

Loads preprocessed images.
Uses the trained model to predict per-die scratch detection.
Maps prediction back to CSV coordinates.

📊 Metrics

Model evaluation is performed at the die level:

Accuracy
Precision
Recall
F1-score

Results are printed and also saved to a CSV:

model.evaluate_die_level(true_labels, pred_labels, save_path="metrics.csv")

🧪 Experiments & Observations

🔁 Input Format Iterations

Original attempts used stretched resized 256x256 images.
This introduced distortions during back-conversion to die coordinates.
🔁 Final solution uses fixed coordinate grid: a pixel is drawn only if a die exists in that coordinate.

🔁 Prediction Mapping

Prediction results in a [2, 256, 256] tensor (2 channels: [prob_not_scratch, prob_scratch]).
Argmax is applied along channel dim to extract final binary scratch mask.

🔁 Model Improvements

CrossEntropyLoss with weighting improves imbalance.
Batch size and DataLoader optimizations improve GPU utilization.
Patience-based early stopping prevents overfitting.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Model		Model
datamap		datamap
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wafer Scratch Detection - Deep Learning Solution

🔧 How to Use

1. Import the Model Class

2. Initialize the Model with Training Data

3. Generate Image Data from Wafer Map

4. Train the Model

5. Make Predictions

6. Evaluate Results

📁 Folder Structure

Key Features

Fixed Coordinate Image Grid

Training Loop

Wafer Filtering

✅Batch Prediction Flow

📊 Metrics

🧪 Experiments & Observations

🔁 Input Format Iterations

🔁 Prediction Mapping

🔁 Model Improvements

📥 Output Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Wafer Scratch Detection - Deep Learning Solution

🔧 How to Use

1. Import the Model Class

2. Initialize the Model with Training Data

3. Generate Image Data from Wafer Map

4. Train the Model

5. Make Predictions

6. Evaluate Results

📁 Folder Structure

Key Features

Fixed Coordinate Image Grid

Training Loop

Wafer Filtering

✅Batch Prediction Flow

📊 Metrics

🧪 Experiments & Observations

🔁 Input Format Iterations

🔁 Prediction Mapping

🔁 Model Improvements

📥 Output Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages