CO1-haplotype frequencies of long-spined sea urchin (Diadema setosum) from the Indo-Malay archipelago (doi:10.23708/ZWQEFN)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

CO1-haplotype frequencies of long-spined sea urchin (Diadema setosum) from the Indo-Malay archipelago

Identification Number:

doi:10.23708/ZWQEFN

Distributor:

DataSuds

Date of Distribution:

2022-10-05

Version:

1

Bibliographic Citation:

Vimono, Indra B.; Borsa, Philippe; Pouyaud, Laurent, 2022, "CO1-haplotype frequencies of long-spined sea urchin (Diadema setosum) from the Indo-Malay archipelago", https://doi.org/10.23708/ZWQEFN, DataSuds, V1, UNF:6:Gq/xRLgto2IIMOuZ+Jjuwg== [fileUNF]

Study Description

Citation

Title:

CO1-haplotype frequencies of long-spined sea urchin (Diadema setosum) from the Indo-Malay archipelago

Subtitle:

Alignment of long-spined sea urchin Diadema setosum nucleotide sequences from the Indo-Malay archipelago, sampling details and haplotype frequencies by sample

Identification Number:

doi:10.23708/ZWQEFN

Authoring Entity:

Vimono, Indra B. (Badan Riset dan Inovasi Nasional - Indonesia)

Borsa, Philippe (UMR Entropie - IRD, Univ.La Réunion, CNRS, Ifremer, UNC - France)

Pouyaud, Laurent (UMR ISEM - University of Montpellier, CNRS, IRD, EPHE, CIRAD, INRAP - France)

Other identifications and acknowledgements:

Hocdé, Régis (UMR Marbec)

Date of Production:

2022-09-23

Software used in Production:

Chromas

Software used in Production:

MegaX

Distributor:

DataSuds

Access Authority:

Borsa, Philippe

Depositor:

Borsa, Philippe

Date of Deposit:

2022-09-29

Holdings Information:

https://doi.org/10.23708/ZWQEFN

Study Scope

Keywords:

Medicine, Health and Life Sciences, mitochondrial DNA, cytochrome-oxidase subunit 1 gene, nucleotide sequence, phylogeography, Coral Triangle, Indonesian seas

Topic Classification:

Continental waters and oceans: generalities, Zoology

Abstract:

The dataset contains three files in text (.txt) format or in tabulation-separated value (.tsv) format, that together characterize haplotype composition at the COI locus in the long-spined sea urchin <a href="https://www.marinespecies.org/aphia.php?p=taxdetails&id=213372"><i>Diadema setosum</i></a> from the Indo-Malay archipelago. The dataset was produced in the course of a phylogeographic study of this tropical Indo-Pacific species, itself part of IBV’s PhD project on the comparative phylogeography of Indo-Pacific sea-urchins of the genera Diadema and Echinometra. <br><br><b>METHODS</b><br> Briefly, long-spined sea urchins were sampled throughout the Indo-Malay archipelago between July 2019 and November 2021. Gonad tissue was dissected and preserved in ethanol. Genomic DNA was extracted and a 1157-nucleotide long segment beginning at the 5’ end of the mitochondrial cytochrome oxidase subunit I (COI) gene was amplified by polymerase-chain reaction according to Ivanova & Grainger’s (2007) protocols. Amplicons were sequenced according to the Sanger protocol. Sequence chromatograms were verified under Chromas v. 2.6.5 (Technelysium, Brisbane, Australia). All nucleotide sequences were deposited in GenBank (Clark et al. 2016) and allocated accession numbers OP310072 to OP310789. Nucleotide sequences were aligned using the ClustalW algorithm implemented in MegaX v. 10.0.4 (Kumar et al. 2018). <br><br><b>DESCRIPTION OF DATASET</b> <ol><li>FILE1 is a table in .tsv format, containing the sampling details of <i>Diadema setosum</i> in the Indo-Malay archipelago. The contents of the columns are the following:<ul><li>“Sequence_ID”: a unique identifier for each nucleotide sequence (see legend to File 2);</li> <li>“Organism”: here, <i>Diadema setosum</i>;</li> <li>“Isolate”: unique identifier given to the DNA extract, identical to the sequence identifier;</li> <li>“Location”: information is inserted in inverted commas and includes country, oceanic region, and precise name of sampling site;</li> <li>“lat_lon”: latitude, followed by longitude, both in decimal degrees;</li> <li>“Sampling_date”: three-letter or four-letter abbreviation for month, followed by year;</li> <li>“GenBank_no”: GenBank accession number (two letters immediately followed by six digits).</li></ul> </li><li>FILE2 is in FASTA (.txt) format. It contains the alignment of nucleotide sequences of 718 <i>Diadema setosum</i> individuals from the Indo-Malay archipelago, over 1157-nucleotide long portion of the CO1 gene. Sequence labels, each preceded by a greater-than sign (“>”) and ended by a line break, were constructed as following: identifier of individual (e.g., “SBD-1”), followed by abbreviation of sampling site (“Sa1”), followed by haplotype identifier (“H1”). The line that follows each sequence label is the nucleotide sequence under that label. </li><li> FILE3, in .tsv format, is a table containing haplotype frequencies by sample. The first line is the headings of the columns; abbreviations for samples are as in File 1. The following 259 lines contain the numbers of haplotypes by sample. Haplotypes are identified by the same identifiers (“Haplotype_ID”) as those used for individual labels in File 1. The last line of the table contains the sums of each column, equal to sample sizes. The last column contains the sums of each line, equal to haplotype frequencies in the total sample. </li></ol> <br><b>REFERENCES</b> <ul><li>Clark, K., Karsch-Mizrachi, I., Lipman, D. J., Ostell, J., Sayers, E. W. (2016). GenBank. Nucleic Acids Research 44, D67–D72. <a href="https://doi.org/10.1093/nar/gkv1276">doi:10.1093/nar/gkv1276</a>.</li> <li>Ivanova, N., Grainger, C. (2007). COI amplification: Taq polymerase choice. CCDB Protocols (Can Ctr DNA Barcoding, Guelph). <a href="http://www.dnabarcoding.ca">www.dnabarcoding.ca</a>.</li> <li>Kumar S, Stecher G, Li M, Knyaz C, Tamura K. 2018. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Molecular Biology and Evolution 35:1547-1549. <a href="https://doi.org/10.1093/molbev/msy096">doi:10.1093/molbev/msy096</a>.</li> </ul>

Date of Collection:

2019-08-01-2021-11-30

Country:

Indonesia

Geographic Unit(s):

Indo-Malay archipelago

Geographic Bounding Box:

  • West Bounding Longitude: 95.254
  • East Bounding Longitude: 140.737
  • South Bounding Latitude: -8.7785
  • North Bounding Latitude: 5.891

Notes:

Data type: Analysis data

Methodology and Processing

Sources Statement

Data Access

Restrictions:

Data files are under embargo, which will be lifted when the related article is submitted for publication.

Citation Requirement:

Please cite the dataset as indicated in the dataset main page.

Notes:

Data files are under embargo, which will be lifted when the related article is submitted for publication.

Other Study Description Materials

Related Publications

Citation

Title:

Vimono I., Borsa P., Hocdé R., Pouyaud L. Phylogeography of the long-spined sea urchin Diadema setosum across the Indo-Malay archipelago. (in prep.)

Bibliographic Citation:

Vimono I., Borsa P., Hocdé R., Pouyaud L. Phylogeography of the long-spined sea urchin Diadema setosum across the Indo-Malay archipelago. (in prep.)

File Description--f37731

File: Diadema_setosum_FILE1_sampling_details_GenBank_nos.tab

  • Number of cases: 718

  • No. of variables per record: 7

  • Type of File: text/tab-separated-values

Notes:

UNF:6:4bvZDYJ1M3O5NuwHcnBSKQ==

Geospatial

File Description--f37707

File: Diadema_setosum_FILE3_haplotype_frequencies.tab

  • Number of cases: 260

  • No. of variables per record: 22

  • Type of File: text/tab-separated-values

Notes:

UNF:6:XGdxjl5mQy7T8qBASxP5vA==

Genomics

Other Study-Related Materials

Label:

Diadema_setosum_dataset_contents.txt

Notes:

text/plain

Other Study-Related Materials

Label:

Diadema_setosum_FILE2_COI_alignment.txt

Text:

Alignment of nucleotide sequences of 718 Diadema setosum individuals (FASTA text format) from the Indo-Malay archipelago, over 1157-nucleotide long portion of the CO1 gene.

Notes:

text/plain