Upload LSTM model and tokenizer

Browse files

Files changed (9) hide show

.gitattributes +1 -0
README.md +80 -0
config.json +0 -0
lstm_model/fingerprint.pb +3 -0
lstm_model/keras_metadata.pb +3 -0
lstm_model/saved_model.pb +3 -0
lstm_model/variables/variables.data-00000-of-00001 +3 -0
lstm_model/variables/variables.index +0 -0
tokenizer.json +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+lstm_model/variables/variables.data-00000-of-00001 filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,80 @@

+---
+tags:
+- text-generation
+- lstm
+- tensorflow
+library_name: tensorflow
+pipeline_tag: text-generation
+---
+# LSTM Text Generation Model
+This model was trained using TensorFlow/Keras for financial article generation tasks.
+## Model Details
+- **Model Type**: LSTM
+- **Framework**: TensorFlow/Keras
+- **Task**: Text Generation
+- **Vocabulary Size**: 41376
+- **Architecture**: Long Short-Term Memory (LSTM)
+## Usage
+```python
+from huggingface_hub import snapshot_download
+import tensorflow as tf
+import json
+import pickle
+import numpy as np
+# Download model files
+model_path = snapshot_download(repo_id="firobeid/L4_LSTM_financial_article_generator")
+# Load the LSTM model
+model = tf.keras.models.load_model(f"{model_path}/lstm_model")
+# Load tokenizer
+try:
+    # Try JSON format first
+    with open(f"{model_path}/tokenizer.json", 'r', encoding='utf-8') as f:
+        tokenizer_json = f.read()
+    tokenizer = tf.keras.preprocessing.text.tokenizer_from_json(tokenizer_json)
+except FileNotFoundError:
+    # Fallback to pickle format
+    with open(f"{model_path}/tokenizer.pkl", 'rb') as f:
+        tokenizer = pickle.load(f)
+# Text generation function
+def generate_text(input_text, num_words=10):
+    # Preprocess input
+    X = np.array(tokenizer.texts_to_sequences([input_text])) - 1
+    # Generate predictions
+    output_text = []
+    for i in range(num_words):
+        y_proba = model.predict(X, verbose=0)[0]
+        pred_word_ind = np.argmax(y_proba, axis=-1) + 1
+        pred_word = tokenizer.index_word[pred_word_ind[-1]]
+        input_text += ' ' + pred_word
+        output_text.append(pred_word)
+        X = np.array(tokenizer.texts_to_sequences([input_text])) - 1
+    return ' '.join(output_text)
+# Example usage
+# Start with these tags: <business>, <entertainment>, <politics>, <sport>, <tech>
+result = generate_text("<tech> The future of artificial intelligence", num_words=15)
+print(result)
+```
+## Training
+This model was trained on text data using LSTM architecture for next-word prediction.
+## Limitations
+- Model performance depends on training data quality and size
+- Generated text may not always be coherent for longer sequences
+- Model architecture is optimized for the specific vocabulary it was trained on

config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

lstm_model/fingerprint.pb ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:76dc228eaab1a254fe2875169b46d865f5a2ec20dcd1d73ed32aaf47dad91207
+size 57

lstm_model/keras_metadata.pb ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5608b1773a530424b5af7952ad1b240e0d9fdb4756bc9af171bf950171990f0d
+size 15917

lstm_model/saved_model.pb ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ce7b4c755b3d99d8cafb673e9a78b101af96de90fbe20090486a82174b7c7ee6
+size 1475596

lstm_model/variables/variables.data-00000-of-00001 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:53a39e598a4ee3ee8b93223a92e0b023918a819a51db5087c07029755c4cf8f8
+size 89112282

lstm_model/variables/variables.index ADDED Viewed

Binary file (705 Bytes). View file

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff