Are there Tensorflow pre-trained models on speech/audio? Like there is VGG16 for image, I want to know about the models which will take raw audio as input. If anyone know, please share?
Related
How do I find more info on how the ssd_mobilenet_v1 tflite model on TFHub was trained?
Was it trained in such a way that made it easy to convert it to tflite by avoiding certain ops not supported by tflite? Or was it trained normally, and then converted using the tflite converter with TF Select and the tips on this github issue?
Also, does anyone know if there's an equivalent mobilenet tflite model trained on OpenImagesV6? If not, what's the best starting point for training one?
I am not sure about about the exact origin of the model, but looks like it does have TFLite-compatible ops. From my experience, the best place to start for TFLite-compatible SSD models is with the TF2 Detection Zoo. You can convert any of the SSD models using these instructions.
To train your own model, you can follow these instructions that leverage Google Cloud.
I am using deepLab to generate semantic segmentation masked images for a video in cityscapes datasets. So, I started with the pre-trained model xception65_cityscapes_trainfine provided on the modelzoo and trained it further on the dataset.
I am curious to know How I can start training it from scratch? and not end up just using the pre-trained model? could anyone suggest a direction on How I can achieve it?
Any contribution from the community will be helpful and appreciated.
I am currently trying to train google inception v3 model with TF learn. Tensorflow has provided the trained model over imagenet dataset. But it is not in a usual .tflearn format. I have gone through the documentation of tflearn, but I cannot find any description to convert tensorflow weights into TFlearn format. I know that the right place to ask this question is tflearn gihub page. But it is currently very busy with high number of open issues.
Could anyone point me to the correct direction?
I have a resnet model trained in caffe. I want to try that with show_and_tell instead of inception_v3 image model.
What is the best way to convert the model form caffe to tensorflow
model ?
How would I go about switching the inception model to resnet
?
I'm not sure about the "best" way, but there are some tools that seem to address this, like https://github.com/ethereon/caffe-tensorflow
Can anyone give an example of how to fine tune a pretrained imagenet network with new data and different classes similar to this:
Fine-tuning a Pretrained Network for Style Recognition
This TensorFlow tutorial describes how to retrain a image classifier for new data and new classes.