KaiquanMah
/

VAE-Banking77-OpenIntentClassification

Model card Files Files and versions

VAE-Banking77-OpenIntentClassification / README.md

KaiquanMah's picture

Update README.md

ee57c6d verified 5 months ago

|

history blame contribute delete

1.26 kB

	---
	license: mit

	datasets:
	- PolyAI/banking77

	language:
	- en

	tags:
	- autoencoder
	---

	# VAE trained on Banking 77 Open Intent Classification Dataset
	This is a Variational Autoencoder (VAE) trained on the [PolyAI/banking77](https://huggingface.co/datasets/PolyAI/banking77) dataset.

	### Architecture
	- input_dim: 768
	- hidden_dim: 256
	- latent_dim: 64

	#### Encoder
	The encoder maps the input to a latent space distribution.

	```python
	encoder = nn.Sequential(
	nn.Linear(input_dim, hidden_dim),
	nn.ReLU()
	)

	mu = nn.Linear(hidden_dim, latent_dim)
	logvar = nn.Linear(hidden_dim, latent_dim)
	```

	#### Decoder
	The decoder reconstructs the input from a sample of the latent space.

	```python
	decoder = nn.Sequential(
	nn.Linear(latent_dim, hidden_dim),
	nn.ReLU(),
	nn.Linear(hidden_dim, input_dim)
	)
	```

	#### Metrics
	The model was trained and evaluated using the following metrics:
	1. Training set: VAE Loss
	* 50% reconstruction loss between original input vs reconstructed output
	* 50% KL divergence between Latent Z vs standard normal distribution
	2. Validation set: 100% reconstruction loss -> used to find the best model (with the lowest reconstruction loss)