GitHub - HCP-Fusion-Oriented-Sw-Def-Project/CCGen

Commit Message Generation using bert as encoder and transformer decoder

I have used a text generation library called Texar , Its a beautiful library with a lot of abstractions, i would say it to be scikit learn for text generation problems.

The main idea behind this architecture is to use the transfer learning from pretrained BERT a masked language model , I have replaced the Encoder part with BERT Encoder and the deocder is trained from the scratch.

One of the advantages of using Transfomer Networks is training is much faster than LSTM based models as we elimanate sequential behaviour in Transformer models.

Transformer based models generate more gramatically correct and coherent sentences.

To run the model

wget https://storage.googleapis.com/bert_models/2018_10_18/uncased_L-12_H-768_A-12.zip unzip uncased_L-12_H-768_A-12.zip

Place the story and summary files under data folder with the following names. -train_story.txt -train_summ.txt -eval_story.txt -eval_summ.txt each story and summary must be in a single line (see sample text given.)

Step1: Run Preprocessing python preprocess.py

This creates two tfrecord files under the data folder.

Step 2: python main.py

Configurations for the model can be changes from config.py file

Step 3: Inference runipy cmg.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data		data
texar_repo		texar_repo
BERT_SUMM.ipynb		BERT_SUMM.ipynb
Readme.md		Readme.md
__init__.py		__init__.py
cmg.ipynb		cmg.ipynb
config.py		config.py
main.py		main.py
model.py		model.py
preprocess.py		preprocess.py
split_msg.py		split_msg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Commit Message Generation using bert as encoder and transformer decoder

To run the model

About

Releases

Packages

Languages

HCP-Fusion-Oriented-Sw-Def-Project/CCGen

Folders and files

Latest commit

History

Repository files navigation

Commit Message Generation using bert as encoder and transformer decoder

To run the model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages