annotation~blast-diamond_for_protein

Add annotation of reference cDNAs/Proteins with tblastn/blastp/diamond

input_1:an input cDNA file

input_1/test.fa

>FBgn0004907;CG17870-PD 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVDDSQTAYQDAFDISKGKMQPTHPIRLGLALNFSVFYYEILNSPDKACQLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0004907;CG17870-PB 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVDDSKNAYQEAFDIAKTKMQPTHPIRLGLALNFSVFYYEILNSPDKACQLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0004907;CG17870-PF 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVEDSKKAYQEAFDIAKTKMQPTHPIRLGLALNFSVFYYEIINSPARACHLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0020238;CG31196-PA 14-3-3epsilon~14-3-3epsilon;14-3-3_protein
MTERENNVYKAKLAEQAERYDEMVEAMKKVASMDVELTVEERNLLSVAYKNVIGARRASWRIITSIEQKEENKGAEEKLEMIKTYRGQVEKELRDICSDILNVLEKHLIPCATSGESKVFYYKMKGDYHRYLAEFATGSDRKDAAENSLIAYKAASDIAMNDLPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQAEEVDPNAGDGEPKEQIQDVEDQDVS
>FBgn0020238;CG31196-PB 14-3-3epsilon~14-3-3epsilon;14-3-3_protein
MTERENNVYKAKLAEQAERYDEMVEAMKKVASMDVELTVEERNLLSVAYKNVIGARRASWRIITSIEQKEENKGAEEKLEMIKTYRGQVEKELRDICSDILNVLEKHLIPCATSGESKVFYYKMKGDYHRYLAEFATGSDRKDAAENSLIAYKAASDIAMNDLPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQAEVDPNAGDGEPKEQIQDVEDQDVS

input_2:reference cDNA files for tblastn

input_3:reference protein files for blastp

input_3/Dm-r6.49.fasta

>FBgn0004907;CG17870-PD 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVDDSQTAYQDAFDISKGKMQPTHPIRLGLALNFSVFYYEILNSPDKACQLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0004907;CG17870-PB 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVDDSKNAYQEAFDIAKTKMQPTHPIRLGLALNFSVFYYEILNSPDKACQLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0004907;CG17870-PF 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVEDSKKAYQEAFDIAKTKMQPTHPIRLGLALNFSVFYYEIINSPARACHLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0020238;CG31196-PA 14-3-3epsilon~14-3-3epsilon;14-3-3_protein
MTERENNVYKAKLAEQAERYDEMVEAMKKVASMDVELTVEERNLLSVAYKNVIGARRASWRIITSIEQKEENKGAEEKLEMIKTYRGQVEKELRDICSDILNVLEKHLIPCATSGESKVFYYKMKGDYHRYLAEFATGSDRKDAAENSLIAYKAASDIAMNDLPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQAEEVDPNAGDGEPKEQIQDVEDQDVS
>FBgn0020238;CG31196-PB 14-3-3epsilon~14-3-3epsilon;14-3-3_protein
MTERENNVYKAKLAEQAERYDEMVEAMKKVASMDVELTVEERNLLSVAYKNVIGARRASWRIITSIEQKEENKGAEEKLEMIKTYRGQVEKELRDICSDILNVLEKHLIPCATSGESKVFYYKMKGDYHRYLAEFATGSDRKDAAENSLIAYKAASDIAMNDLPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQAEVDPNAGDGEPKEQIQDVEDQDVS

input_4:reference protein files for diamond

input_4/Dm-r6.49.fasta

>FBgn0004907;CG17870-PD 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVDDSQTAYQDAFDISKGKMQPTHPIRLGLALNFSVFYYEILNSPDKACQLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0004907;CG17870-PB 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVDDSKNAYQEAFDIAKTKMQPTHPIRLGLALNFSVFYYEILNSPDKACQLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0004907;CG17870-PF 14-3-3zeta~14-3-3zeta;14-3-3_protein
MSTVDKEELVQKAKLAEQSERYDDMAQAMKSVTETGVELSNEERNLLSVAYKNVVGARRSSWRVISSIEQKTEASARKQQLAREYRERVEKELREICYEVLGLLDKYLIPKASNPESKVFYLKMKGDYYRYLAEVATGDARNTVVEDSKKAYQEAFDIAKTKMQPTHPIRLGLALNFSVFYYEIINSPARACHLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDTQGDEAEPQEGGDN
>FBgn0020238;CG31196-PA 14-3-3epsilon~14-3-3epsilon;14-3-3_protein
MTERENNVYKAKLAEQAERYDEMVEAMKKVASMDVELTVEERNLLSVAYKNVIGARRASWRIITSIEQKEENKGAEEKLEMIKTYRGQVEKELRDICSDILNVLEKHLIPCATSGESKVFYYKMKGDYHRYLAEFATGSDRKDAAENSLIAYKAASDIAMNDLPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQAEEVDPNAGDGEPKEQIQDVEDQDVS
>FBgn0020238;CG31196-PB 14-3-3epsilon~14-3-3epsilon;14-3-3_protein
MTERENNVYKAKLAEQAERYDEMVEAMKKVASMDVELTVEERNLLSVAYKNVIGARRASWRIITSIEQKEENKGAEEKLEMIKTYRGQVEKELRDICSDILNVLEKHLIPCATSGESKVFYYKMKGDYHRYLAEFATGSDRKDAAENSLIAYKAASDIAMNDLPPTHPIRLGLALNFSVFYYEILNSPDRACRLAKAAFDDAIAELDTLSEESYKDSTLIMQLLRDNLTLWTSDMQAEVDPNAGDGEPKEQIQDVEDQDVS

Command

annotation~blast-diamond_for_protein -c 8 -m 32 -x input_2/ -y input_3/ -z input_4/ input_1/test.fa

Output

result.txt

qseqid	length	blastp.Dm-r6.49.fasta:sseqid	qlen	slen	pident	length	mismatch	gapopen	qstart	qend	sstart	send	sframe	evalue	bitscore	stitle	diamond.Dm-r6.49.fasta:sseqid	qlen	slen	pident	length	mismatch	gapopen	qstart	qend	sstart	send	sframe	evalue	bitscore	stitle	length														
FBgn0004907;CG17870-PB	248	FBgn0004907;CG17870-PB	248	248	100.000	248	0	0	1	248	1	248	1	0.0	509	FBgn0004907;CG17870-PB 14-3-3zeta~14-3-3zeta;14-3-3_protein	FBgn0004907;CG17870-PB	248	248	100	248	0	0	1	248	1	248	0	6.45e-171	471	FBgn0004907;CG17870-PB 14-3-3zeta~14-3-3zeta;14-3-3_protein	248														
FBgn0004907;CG17870-PD	248	FBgn0004907;CG17870-PD	248	248	100.000	248	0	0	1	248	1	248	1	0.0	510	FBgn0004907;CG17870-PD 14-3-3zeta~14-3-3zeta;14-3-3_protein	FBgn0004907;CG17870-PD	248	248	100	248	0	0	1	248	1	248	0	9.17e-171	470	FBgn0004907;CG17870-PD 14-3-3zeta~14-3-3zeta;14-3-3_protein	248														
FBgn0004907;CG17870-PF	248	FBgn0004907;CG17870-PF	248	248	100.000	248	0	0	1	248	1	248	1	0.0	511	FBgn0004907;CG17870-PF 14-3-3zeta~14-3-3zeta;14-3-3_protein	FBgn0004907;CG17870-PF	248	248	100	248	0	0	1	248	1	248	0	3.20e-171	471	FBgn0004907;CG17870-PF 14-3-3zeta~14-3-3zeta;14-3-3_protein	248														
FBgn0020238;CG31196-PA	262	FBgn0020238;CG31196-PA	262	262	100.000	262	0	0	1	262	1	262	1	0.0	539	FBgn0020238;CG31196-PA 14-3-3epsilon~14-3-3epsilon;14-3-3_protein	FBgn0020238;CG31196-PA	262	262	100	262	0	0	1	262	1	262	0	1.21e-182	501	FBgn0020238;CG31196-PA 14-3-3epsilon~14-3-3epsilon;14-3-3_protein	262														
FBgn0020238;CG31196-PB	261	FBgn0020238;CG31196-PB	261	261	100.000	261	0	0	1	261	1	261	1	0.0	538	FBgn0020238;CG31196-PB 14-3-3epsilon~14-3-3epsilon;14-3-3_protein	FBgn0020238;CG31196-PB	261	261	100	261	0	0	1	261	1	261	0	4.56e-182	500	FBgn0020238;CG31196-PB 14-3-3epsilon~14-3-3epsilon;14-3-3_protein	261														

view all outputs

Log