[
¥È¥Ã¥×
] [
¿·µ¬
|
°ìÍ÷
|
ñ¸ì¸¡º÷
|
ºÇ½ª¹¹¿·
|
¥Ø¥ë¥×
]
³«»Ï¹Ô:
[[»³Æâ¤Î¥µ¥¤¥È]]
**¿ûÌî¥ê¥¹¥È¤Î½èÍý [#y59cb6c2]
-¿ûÌî¥ê¥¹¥È¤ÎRefSeqÍó¤«INSDCÍ󤫤顢¥ê¥ó¥¯Àè¤ò¥ª¡¼¥×¥ó¤¹¤ë
-¤â¤·Genome¥Ú¡¼¥¸ ¡ÊCP029494.1 ¤Ê¤É¡Ë¤Ê¤é¡¢Display ¤òfull sequence¤ò´Þ¤à¤ËÀßÄꤷ¤¿¸å¡¢SendTo¤Ç¥Õ¥¡¥¤¥ë¤Ë¥À¥¦¥ó¥í¡¼¥É¤¹¤ë¡£¥Õ¥¡¥¤¥ë̾¤òCP029494.gb¤È¤¹¤ë¡£
-¤â¤·Contig¥Ú¡¼¥¸ ¡ÊNZ_NHMK00000000.1¡Ë ¤Ê¤é¡¢²¼¤ÎÊý¤ÎWGS¤Î¥ê¥ó¥¯Àè¡ÖNHMK01000001-NHMK01000040¡×¤ò¹¤²¤Æ¡¢¥¿¥Ö¤«¤éDownload¤òÁª¤ó¤Ç³«¤¯¡£¤½¤ÎÀè¤Ë´Þ¤Þ¤ì¤ëGenBank¤Îgbff.gz¥Õ¥¡¥¤¥ë ¡ÖGenBank:NHMK01.1.gbff.gz¡×¤ò¥À¥¦¥ó¥í¡¼¥É¤¹¤ë¡£¤³¤ÎÃæ¤Ë¤Ïcontig¤´¤È¤Îgb·Á¼°¥Ç¡¼¥¿¤¬Ê£¿ô³¤±¤Æ//¤Ç¶èÀÚ¤é¤ì¤Æ´Þ¤Þ¤ì¤Æ¤¤¤ë¡£
***´ßËÜÀèÀ¸¤Î¥³¥á¥ó¥È 2019-10-18 [#w5baed3d]
>¤Þ¤ººÇ½é¤Ë¤·¤¿¤éÎɤ¤¤Ê¤È»×¤¦¤Î¤¬¡¢¿ûÌ¤ó¤Î¥ê¥¹¥È¤Ç
>TemperatureRange¡¡¡¡¤Ë Thermophilic, Hyperthermopohilic, Mesophilic ¤ÈµºÜ¤µ¤ì¤Æ¤¤¤ë¶Ý¤òÂоݤȤ¹¤ë¤È¤¤¤¦Êª¤Ç¤¹¡£
¤³¤³¤ËµºÜ¤¬¤¢¤ë¤Î¤Ï¡¢¥Ç¡¼¥¿¥Ù¡¼¥¹¤«ÏÀʸ¤Ç¤¤Á¤ó¤ÈÁý¿£²¹ÅÙ¤«¤éʬÎबµºÜ¤µ¤ì¤Æ¤¤¤ëʪ¤Ë¤Ê¤ê¡¢Èæ³Ó²òÀϤ·¤¿¤È¤¤Ë¥Ç¡¼¥¿¤Î¿®ØáÀ¤¬¹â¤¯¤Ê¤ë¤È»×¤¤¤Þ¤¹¡£
¤³¤ÎÁªÂò¤Ç¡¢£µ£±¼ïÎà¤Ë¤Ê¤ê¤Þ¤¹¡£
>TemperatureRange¤Ë²Ã¤¨¤Æ¡¢Optimal temp (¡î)¡¡¤ÎµºÜ¤¬¤¢¤ë¤Î¤¬¡¢£³£¸¼ïÎà¤Ë¤Ê¤ê¤Þ¤¹¡£
>ºÇ¸å¤Ë¤Ç¤¹¤¬¡¢»ä¤¬ÁªÂò¤·¤¿£±£¸¼ïÎà¤ò²«¿§¤Ë¥»¥ë¤Ç¿§¤òÉÕ¤±¤Æ¤ß¤Þ¤·¤¿¡£
Íýͳ¤Ï¡¢¼ïÆâ¡Ê¶á±ï¼ï¡Ë¤Ç mesophilic¡¡¤È¡¡thermophilic¡¡¤¬¤¢¤ê¡¢¼ïÆâ¤Ç¼«Á³³¦¤ÇºÆ¹â²¹¿Ê²½¤·¤¿²ÄǽÀ¤¬¹â¤¤¶Ý¤ò´Þ¤à¥°¥ë¡¼¥×¤ËÃíÌܤ·¤Þ¤·¤¿¡£
¤È¡¢Èó¾ï¤ËÃøÌ¾¤Ê¹â²¹¶Ý¤òÄɲ䷤ƥꥹ¥È¤Ë¤·¤Æ¤ß¤Þ¤·¤¿¡£
¤È¤¤¤¦¤³¤È¤Ç¡¢¤³¤Î19¼ïÎà¤Ë¤Ä¤¤¤ÆGenBank¥Õ¥¡¥¤¥ë¤ò¥À¥¦¥ó¥í¡¼¥É¤¹¤ë¡£
¤³¤Î¤¦¤Á¡¢
|Parageobacillus toebii |60 |Thermophilic | BDAQ00000000.1|
|Geobacillus jurassicus |60-65 |Thermophilic | BCQG00000000.1|
|Thermotoga profunda |65 |Thermophilic¡¡ | AP014510.1|
|Thermotoga caldifontis |75 |Thermophilic¡¡ | AP014509.1|
¤Ë¤Ä¤¤¤Æ¤Ï¡¢GB¥Õ¥¡¥¤¥ë¤ËCDS¥Õ¥£¡¼¥Á¥ã¡¼¤¬ÉÕ¤¤¤Æ¤¤¤Ê¤¤¤Î¤ÇÂоݤ«¤é³°¤·¤¿¡£
¤Þ¤¿¡¢GB¥Õ¥¡¥¤¥ë¤Ç¤Ï̵¤¯GBFF·Á¼°¤Î¥Õ¥¡¥¤¥ë¤¬GZ°µ½Ì¤µ¤ì¤¿¤â¤Î¤¬¥À¥¦¥ó¥í¡¼¥É¤Ç¤¤ë
BAWO01.1.gbff BCQG01.1.gbff BDAQ01.1.gbff JPYA01.1.gbff
¤Ë¤Ä¤¤¤Æ¤Ï¡¢GZ¤ò²òÅष¤¿¸å¡¢GBFF·Á¼°¤Î¤Þ¤Þ½èÍý¤¹¤ë¤³¤È¤Ë¤¹¤ë¡£¡Ê¼¡¤Î¥»¥¯¥·¥ç¥ó¤Ç½èÍý¤Î¸ß´¹¤Ë¤Ä¤¤¤Æ¥Æ¥¹¥È¡Ë
***gbff·Á¼°¤Î¥Õ¥¡¥¤¥ë¤¬biopython¤ÎgenbankÆþÎϤDzòÆÉ¤Ç¤¤ë¤« [#a2f27fd6]
import pandas as pd
from ReadCDSwithGene import ReadCDS
def main():
gbfile = 'heat/BFAG01.1.gbff'
CDS = ReadCDS(gbfile)
print(CDS.head())
print(CDS.tail(10))
if __name__ == '__main__':
main()
print('complete')
¤Ç¡¢½ÐÎϤÏ
pos len strand locus_tag gene \
0 431 1878 1 DAERI_010001
1 2342 783 1 DAERI_010002
2 3145 2448 -1 DAERI_010003
3 5579 480 1 DAERI_010004
4 6060 738 1 DAERI_010005
product \
0 hypothetical protein
1 carboxypeptidase regulatory-like domain-contai...
2 serine/threonine-protein kinase transcriptiona...
3 hypothetical protein
4 transcriptional activator domain protein
seq \
0 (A, T, G, A, A, C, C, G, A, C, C, C, C, T, G, ...
1 (A, T, G, A, A, C, A, A, G, C, G, T, T, C, C, ...
2 (A, T, G, G, G, C, G, G, G, T, T, C, A, T, G, ...
3 (A, T, G, A, A, C, C, C, G, C, C, C, A, T, T, ...
4 (A, T, G, A, C, G, C, A, G, G, A, C, A, C, G, ...
AAseq
0 [MNRPLTASTLLLTALLSACTTGGSTPGPTVKTIDLSPATASVAVG...
1 [MNKRSLLAAALSLLLAGCTTGADGTGRPPTPAPNPAPRPAQAHTM...
2 [MGGFMVHLGSRGLFVPSDPQLREGALAAHPWFGGGAASPQWGETR...
3 [MNPPIPAPLRRVTPENTYALRADRFSVLLGGEDTGGRLAVIDLCA...
4 [MTQDTVTGAASWTVQVLGQAGLRGPDGALRPLERKAAALLAYLAV...
¤È¤¤¤¦¤³¤È¤Ç¡¢ÆÉ¤á¤ë¤é¤·¤¤¡£
***Èæ³Ó½èÍý¤Î¼ÂºÝ [#ee054ce3]
***¥È¥Ã¥×200¤ÎÃê½Ð [#q169a2b1]
***¥È¥Ã¥×200¤ÈEssential Genes (e.g. Goodall)¤È¤ÎÈæ³Ó [#y651defd]
£±¡Ë¥È¥Ã¥×200¤È¡¢EssentialGenes¤òÈæ³Ó¤·¡¢EssentialGenes¤Ë´Þ¤Þ¤ì¤Ê¤¤¤Ç¤«¤Ä¥È¥Ã¥×200¤Ë´Þ¤Þ¤ì¤ë¤â¤Î¤òÃê½Ð¤¹¤ë¡£
£²¡Ë£±¡Ë¤Î¤¦¤Á¤Ç¡¢¤¹¤Ù¤Æ¤Î¹¥Ç®¶Ý¤Ë¶¦Ä̤ʤâ¤Î¤òõ¤¹¡£¡Ê¤ª¤½¤é¤¯product̾¤Î°ìÃפǥե£¥ë¥¿¥ê¥ó¥°¤¹¤ëɬÍפ¬¤¢¤ë¤À¤í¤¦¡Ë
***¿Ê²½Èæ³Ó¤Î¤¿¤á¤Îribosomal RNA S16¤ÎÃê½Ð [#e092fa00]
Ʊ¤¸¥ê¥¹¥È¤«¤é¡¢GB¤Î¥Õ¥£¡¼¥Á¥ã¡¼Ãæ¤ÎrRNA¤Î¤¦¤Á¤Îribosomal RNA A16¤Î¥·¡¼¥±¥ó¥¹¤òÃê½Ð¤·¤Æ¤ß¤ë¡£
½ªÎ»¹Ô:
[[»³Æâ¤Î¥µ¥¤¥È]]
**¿ûÌî¥ê¥¹¥È¤Î½èÍý [#y59cb6c2]
-¿ûÌî¥ê¥¹¥È¤ÎRefSeqÍó¤«INSDCÍ󤫤顢¥ê¥ó¥¯Àè¤ò¥ª¡¼¥×¥ó¤¹¤ë
-¤â¤·Genome¥Ú¡¼¥¸ ¡ÊCP029494.1 ¤Ê¤É¡Ë¤Ê¤é¡¢Display ¤òfull sequence¤ò´Þ¤à¤ËÀßÄꤷ¤¿¸å¡¢SendTo¤Ç¥Õ¥¡¥¤¥ë¤Ë¥À¥¦¥ó¥í¡¼¥É¤¹¤ë¡£¥Õ¥¡¥¤¥ë̾¤òCP029494.gb¤È¤¹¤ë¡£
-¤â¤·Contig¥Ú¡¼¥¸ ¡ÊNZ_NHMK00000000.1¡Ë ¤Ê¤é¡¢²¼¤ÎÊý¤ÎWGS¤Î¥ê¥ó¥¯Àè¡ÖNHMK01000001-NHMK01000040¡×¤ò¹¤²¤Æ¡¢¥¿¥Ö¤«¤éDownload¤òÁª¤ó¤Ç³«¤¯¡£¤½¤ÎÀè¤Ë´Þ¤Þ¤ì¤ëGenBank¤Îgbff.gz¥Õ¥¡¥¤¥ë ¡ÖGenBank:NHMK01.1.gbff.gz¡×¤ò¥À¥¦¥ó¥í¡¼¥É¤¹¤ë¡£¤³¤ÎÃæ¤Ë¤Ïcontig¤´¤È¤Îgb·Á¼°¥Ç¡¼¥¿¤¬Ê£¿ô³¤±¤Æ//¤Ç¶èÀÚ¤é¤ì¤Æ´Þ¤Þ¤ì¤Æ¤¤¤ë¡£
***´ßËÜÀèÀ¸¤Î¥³¥á¥ó¥È 2019-10-18 [#w5baed3d]
>¤Þ¤ººÇ½é¤Ë¤·¤¿¤éÎɤ¤¤Ê¤È»×¤¦¤Î¤¬¡¢¿ûÌ¤ó¤Î¥ê¥¹¥È¤Ç
>TemperatureRange¡¡¡¡¤Ë Thermophilic, Hyperthermopohilic, Mesophilic ¤ÈµºÜ¤µ¤ì¤Æ¤¤¤ë¶Ý¤òÂоݤȤ¹¤ë¤È¤¤¤¦Êª¤Ç¤¹¡£
¤³¤³¤ËµºÜ¤¬¤¢¤ë¤Î¤Ï¡¢¥Ç¡¼¥¿¥Ù¡¼¥¹¤«ÏÀʸ¤Ç¤¤Á¤ó¤ÈÁý¿£²¹ÅÙ¤«¤éʬÎबµºÜ¤µ¤ì¤Æ¤¤¤ëʪ¤Ë¤Ê¤ê¡¢Èæ³Ó²òÀϤ·¤¿¤È¤¤Ë¥Ç¡¼¥¿¤Î¿®ØáÀ¤¬¹â¤¯¤Ê¤ë¤È»×¤¤¤Þ¤¹¡£
¤³¤ÎÁªÂò¤Ç¡¢£µ£±¼ïÎà¤Ë¤Ê¤ê¤Þ¤¹¡£
>TemperatureRange¤Ë²Ã¤¨¤Æ¡¢Optimal temp (¡î)¡¡¤ÎµºÜ¤¬¤¢¤ë¤Î¤¬¡¢£³£¸¼ïÎà¤Ë¤Ê¤ê¤Þ¤¹¡£
>ºÇ¸å¤Ë¤Ç¤¹¤¬¡¢»ä¤¬ÁªÂò¤·¤¿£±£¸¼ïÎà¤ò²«¿§¤Ë¥»¥ë¤Ç¿§¤òÉÕ¤±¤Æ¤ß¤Þ¤·¤¿¡£
Íýͳ¤Ï¡¢¼ïÆâ¡Ê¶á±ï¼ï¡Ë¤Ç mesophilic¡¡¤È¡¡thermophilic¡¡¤¬¤¢¤ê¡¢¼ïÆâ¤Ç¼«Á³³¦¤ÇºÆ¹â²¹¿Ê²½¤·¤¿²ÄǽÀ¤¬¹â¤¤¶Ý¤ò´Þ¤à¥°¥ë¡¼¥×¤ËÃíÌܤ·¤Þ¤·¤¿¡£
¤È¡¢Èó¾ï¤ËÃøÌ¾¤Ê¹â²¹¶Ý¤òÄɲ䷤ƥꥹ¥È¤Ë¤·¤Æ¤ß¤Þ¤·¤¿¡£
¤È¤¤¤¦¤³¤È¤Ç¡¢¤³¤Î19¼ïÎà¤Ë¤Ä¤¤¤ÆGenBank¥Õ¥¡¥¤¥ë¤ò¥À¥¦¥ó¥í¡¼¥É¤¹¤ë¡£
¤³¤Î¤¦¤Á¡¢
|Parageobacillus toebii |60 |Thermophilic | BDAQ00000000.1|
|Geobacillus jurassicus |60-65 |Thermophilic | BCQG00000000.1|
|Thermotoga profunda |65 |Thermophilic¡¡ | AP014510.1|
|Thermotoga caldifontis |75 |Thermophilic¡¡ | AP014509.1|
¤Ë¤Ä¤¤¤Æ¤Ï¡¢GB¥Õ¥¡¥¤¥ë¤ËCDS¥Õ¥£¡¼¥Á¥ã¡¼¤¬ÉÕ¤¤¤Æ¤¤¤Ê¤¤¤Î¤ÇÂоݤ«¤é³°¤·¤¿¡£
¤Þ¤¿¡¢GB¥Õ¥¡¥¤¥ë¤Ç¤Ï̵¤¯GBFF·Á¼°¤Î¥Õ¥¡¥¤¥ë¤¬GZ°µ½Ì¤µ¤ì¤¿¤â¤Î¤¬¥À¥¦¥ó¥í¡¼¥É¤Ç¤¤ë
BAWO01.1.gbff BCQG01.1.gbff BDAQ01.1.gbff JPYA01.1.gbff
¤Ë¤Ä¤¤¤Æ¤Ï¡¢GZ¤ò²òÅष¤¿¸å¡¢GBFF·Á¼°¤Î¤Þ¤Þ½èÍý¤¹¤ë¤³¤È¤Ë¤¹¤ë¡£¡Ê¼¡¤Î¥»¥¯¥·¥ç¥ó¤Ç½èÍý¤Î¸ß´¹¤Ë¤Ä¤¤¤Æ¥Æ¥¹¥È¡Ë
***gbff·Á¼°¤Î¥Õ¥¡¥¤¥ë¤¬biopython¤ÎgenbankÆþÎϤDzòÆÉ¤Ç¤¤ë¤« [#a2f27fd6]
import pandas as pd
from ReadCDSwithGene import ReadCDS
def main():
gbfile = 'heat/BFAG01.1.gbff'
CDS = ReadCDS(gbfile)
print(CDS.head())
print(CDS.tail(10))
if __name__ == '__main__':
main()
print('complete')
¤Ç¡¢½ÐÎϤÏ
pos len strand locus_tag gene \
0 431 1878 1 DAERI_010001
1 2342 783 1 DAERI_010002
2 3145 2448 -1 DAERI_010003
3 5579 480 1 DAERI_010004
4 6060 738 1 DAERI_010005
product \
0 hypothetical protein
1 carboxypeptidase regulatory-like domain-contai...
2 serine/threonine-protein kinase transcriptiona...
3 hypothetical protein
4 transcriptional activator domain protein
seq \
0 (A, T, G, A, A, C, C, G, A, C, C, C, C, T, G, ...
1 (A, T, G, A, A, C, A, A, G, C, G, T, T, C, C, ...
2 (A, T, G, G, G, C, G, G, G, T, T, C, A, T, G, ...
3 (A, T, G, A, A, C, C, C, G, C, C, C, A, T, T, ...
4 (A, T, G, A, C, G, C, A, G, G, A, C, A, C, G, ...
AAseq
0 [MNRPLTASTLLLTALLSACTTGGSTPGPTVKTIDLSPATASVAVG...
1 [MNKRSLLAAALSLLLAGCTTGADGTGRPPTPAPNPAPRPAQAHTM...
2 [MGGFMVHLGSRGLFVPSDPQLREGALAAHPWFGGGAASPQWGETR...
3 [MNPPIPAPLRRVTPENTYALRADRFSVLLGGEDTGGRLAVIDLCA...
4 [MTQDTVTGAASWTVQVLGQAGLRGPDGALRPLERKAAALLAYLAV...
¤È¤¤¤¦¤³¤È¤Ç¡¢ÆÉ¤á¤ë¤é¤·¤¤¡£
***Èæ³Ó½èÍý¤Î¼ÂºÝ [#ee054ce3]
***¥È¥Ã¥×200¤ÎÃê½Ð [#q169a2b1]
***¥È¥Ã¥×200¤ÈEssential Genes (e.g. Goodall)¤È¤ÎÈæ³Ó [#y651defd]
£±¡Ë¥È¥Ã¥×200¤È¡¢EssentialGenes¤òÈæ³Ó¤·¡¢EssentialGenes¤Ë´Þ¤Þ¤ì¤Ê¤¤¤Ç¤«¤Ä¥È¥Ã¥×200¤Ë´Þ¤Þ¤ì¤ë¤â¤Î¤òÃê½Ð¤¹¤ë¡£
£²¡Ë£±¡Ë¤Î¤¦¤Á¤Ç¡¢¤¹¤Ù¤Æ¤Î¹¥Ç®¶Ý¤Ë¶¦Ä̤ʤâ¤Î¤òõ¤¹¡£¡Ê¤ª¤½¤é¤¯product̾¤Î°ìÃפǥե£¥ë¥¿¥ê¥ó¥°¤¹¤ëɬÍפ¬¤¢¤ë¤À¤í¤¦¡Ë
***¿Ê²½Èæ³Ó¤Î¤¿¤á¤Îribosomal RNA S16¤ÎÃê½Ð [#e092fa00]
Ʊ¤¸¥ê¥¹¥È¤«¤é¡¢GB¤Î¥Õ¥£¡¼¥Á¥ã¡¼Ãæ¤ÎrRNA¤Î¤¦¤Á¤Îribosomal RNA A16¤Î¥·¡¼¥±¥ó¥¹¤òÃê½Ð¤·¤Æ¤ß¤ë¡£
¥Ú¡¼¥¸Ì¾: