visual-speaker-authentication

Visual speaker authentication with random prompt texts by a Multi-task CNN Framework

1.prepare data

put grid lip frames dir in ./data the structure should be like:

    ./data/GRID/
    lip/
        s1/
            bbaf2n/
                0.jpeg
                1.jpeg
                2.jpeg
                ...  
                74.jpeg
            ...
        s2/
            ...
        ...
    
    alignments/
        s1/align/
                bbaf2n.align
                ...
        s2/align/
                ...
        ...

2.train a world model

    python train lipnet_res3d.py

3.train client model

    # for example
    # train the client 25's model with his 25 samples 
    python visual_speaker_authentication.py train 24 25
    # test all the other person in the test set. the 0 represent the log file index
    python test 25 0

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
common		common
core		core
model		model
.gitignore		.gitignore
README.md		README.md
gridBaseDataset.py		gridBaseDataset.py
gridDatasetGenerator.py		gridDatasetGenerator.py
gridSinglePersonAuthentication.py		gridSinglePersonAuthentication.py
train_lipnet_res3d.py		train_lipnet_res3d.py
visual_speaker_authentication.py		visual_speaker_authentication.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

visual-speaker-authentication

1.prepare data

2.train a world model

3.train client model

About

Releases

Packages

Languages

klauscc/visual-speaker-authentication

Folders and files

Latest commit

History

Repository files navigation

visual-speaker-authentication

1.prepare data

2.train a world model

3.train client model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages