Genome-wide mRNA and miRNA expression data analysis to screen for markers involved in sarcomagenesis in human chondrosarcoma cell lines

Genome-wide mRNA and miRNA expression data analysis to screen for markers involved in sarcomagenesis in human chondrosarcoma cell lines

GDATA-00098; No of Pages 5 Genomics Data xxx (2014) xxx–xxx Contents lists available at ScienceDirect Genomics Data journal homepage: http://www.jou...

1MB Sizes 0 Downloads 47 Views

GDATA-00098; No of Pages 5 Genomics Data xxx (2014) xxx–xxx

Contents lists available at ScienceDirect

Genomics Data journal homepage: http://www.journals.elsevier.com/genomics-data/

1

Data in Brief

4Q1

Biju Issac a,⁎, Karina Galoian b, Toumy Guettouche c, Loida Navarro d, H. Thomas Temple b,e

O

F

3

Genome-wide mRNA and miRNA expression data analysis to screen for markers involved in sarcomagenesis in human chondrosarcoma cell lines

5 6 7 8 9Q2

a

1 0

a r t i c l e

11 12 13 14 15

Article history: Received 3 September 2014 Received in revised form 30 September 2014 Accepted 1 October 2014 Available online xxxx

16 17 18 19 20 21

Keywords: Human chrondrosarcoma mRNA expression miRNAs Microarray Bioinformatics

Biostatistics and Bioinformatics Core/Division of Bioinformatics, Sylvester comprehensive Cancer Center, University of Miami, 1550 NW 10th Ave, Fox 304A, Miami, FL 33136, USA Department of Orthopaedic Surgery, Miller School of Medicine, University of Miami, 1600 NW 10th Ave, S 8012, Miami, FL 33136, USA Children's Hospital of Philadelphia, Center of Applied Genomics, 3615 Civic center Blvd, Philadelphia, PA 19104-4318, USA d Oncogenomics Core, Sylvester Comprehensive Cancer Center, University of Miami, 1501 NW 10th Ave, BRB 542 B.02, Miami FL 33136, USA e Department of Orthopaedic Surgery and University of Miami Tissue Bank Division, Miller School of Medicine, University of Miami, 1400 NW 12th Ave, UMH, East Bldg 4036, Miami, FL 33136, USA b

a b s t r a c t

P

i n f o

Genes and miRNAs involved in sarcomagenesis related pathways are unknown and therefore signaling events leading to mesenchymal cell transformation to sarcoma are poorly elucidated. Exiqon and Illumina microarray study on human chondrosarcoma JJ012 and chondrocytes C28 cell lines to compare and analyze the differentially expressed miRNAs and their gene targets was recently published in the Journal Tumor Biology in 2014. Here we describe in details the contents and quality controls for the miRNA and gene expression data associated with the study that is relevant to this dataset. © 2014 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

E

T E

32

Organism/cell line/tissue

t1:4 t1:5

Sex Sequencer or array type

t1:6 t1:7 t1:8

Data format Experimental factors Experimental features

t1:9 t1:10

Consent Sample source location

Human Chondrosarcoma JJ012 and Chondrocytes C28 Cell lines Not applicable Exiqon miRNA array and Illumina Human HT-12 Gene expression array Raw Tumor vs normal Whole genome coverage transcriptional coverage of characterized and uncharacterized RNAs using arrays Not applicable Miami, Florida, USA

R

t1:3

N C O

Specifications

U

t1:2

R

t1:1

35

Direct link to deposited data

36

Deposited data can be found here: http://www.ncbi.nlm.nih.gov/ geo/query/acc.cgi?acc=GSE48420.

37

22 23 24 25 26 27 28 29

C

33 31 30

34

R O

c

D

2

Abbreviations: SD, standard deviation; PCA, principal component analysis. ⁎ Corresponding author. Tel.: +1 305 243 6741; fax: +1 305 243 9342. E-mail address: [email protected] (B. Issac).

Experimental design, materials and methods

38

Study cell line data

39

Human chondrosarcoma (JJ012) and chrondrocytes (C28) cell lines were obtained from the University of Miami Tissue bank and cultured in monolayer until they were semiconfluent as described previously [1,2]. Cells were lysed and 100 μg of total RNA from the sample was prepared using the Qiagen RNeasy mini kit. NanoDrop 8000 spectrometer was used to check for RNA integrity and the presence of small RNA fractions was determined using Agilent Bioanalyzer 2100.

40

miRNA expression data

47

20 ng of total RNA per panel was reverse transcribed using the miRCURY LNA™ Universal RT microRNA PCR and polyadenylation and cDNA synthesis kit (Exiqon, Woburn, MA). For each sample, the cDNA was diluted, and each reaction was combined with SYBR Green Master Mix (Exiqon) and added to the Ready-to-Use PCR panels. Human microRNA Ready-to-Use PCR panels I and II hold 743 different miRNA targets and six reference gene assays (mirBase13). They were tested on a Roche LightCycler 480 realtime PCR instrument.

48

http://dx.doi.org/10.1016/j.gdata.2014.10.001 2213-5960/© 2014 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Please cite this article as: B. Issac, et al., Genome-wide mRNA and miRNA expression data analysis to screen for markers involved in sarcomagenesis in human chondrosarcoma cell lines, Genomics Data (2014), http://dx.doi.org/10.1016/j.gdata.2014.10.001

41 42 43 44 45 46

49 50 51 52 53 54 55 56

Normalization

76

81

As recommended by the software user guide for screening studies, variability arising from differences in sample handling was minimized using global mean normalization with mean expression of all genes. Markers with Cq larger than 34 were not included as directed. Replicates were averaged and values were converted to relative quantities on the linear scale prior to log2 conversion.

82

Basic analysis

83

The processed miRNA data was analyzed using hierarchical clustering (Fig. 1) and principal component analysis (Fig. 2) methods to visually

84

The mRNA expression data were loaded on GeneSpring® 12.5 GX (Agilent Technologies, Santa Clara, CA, USA). The data for all probes including those with missing values were included in the preprocessing steps. Quality assessments were done visually using various plots including PCA (Fig. 4). All samples were found to be of high quality with intra-group correlation of expression data within 98– 99% match.

87 88 89 90

93 94 95

97 98 99 100 101 102 103

Normalization

104

Normalization of the mRNA expression data was performed using the ‘Quantile’ normalization method available in GeneSpring and the data was log2 transformed to the mean of all samples before statistical analyses.

105 106

C

79 80

96

Quality control

E

77 78

92

R

71 72

Expression data was obtained by hybridizing extracted RNA from samples to HumanHT-12 V4_0_R1 array platform from Illumina. Care was taken to obtain RNA aliquots of same harvested cells as with miRNA expression data.

R

69 70

91

O

67 68

Gene expression data

C

65 66

N

63 64

U

62

F

75

60 61

R O

73 74

We used the GenEx version 5 software from MultiD Analyses AB to analyze the RT-PCR data of the human miRNAs. Layout files for human panels I and II version 2 with miRBase 16 annotations was loaded on the software. miRNA expression data was imported and sample and group classification columns were appropriately modified to define the replicates. The groups were labeled as ‘SARC’ for human chondrosarcoma and ‘C28’ for chondrocytes. The data was pre-processed as provided by the software guidelines. Interplate calibration was performed using mean values between plates. Outlier data points were deleted from analysis using Grubb's test on replicates with default confidence level (0.95) and cut-off SD (0.25). All values which were larger than 37 (default) were replaced with a blank and all empty rows and undetected miRNAs were removed. Missing ‘NaN’ values were also replaced and the data was validated to remove data points for miRNAs which had less than 75% values from replicates within a group. All remaining missing values were now imputed based on groups and empty cells were filled with a value of 38 as recommended.

85 86

P

58 59

interpret the differential expression between groups. The groups were noted to have sufficient differences for a more robust statistical significance analysis. The C28 and SARC were compared using a 2-tailed unpaired T-test and the resulting P-values were corrected for false discovery using the well documented Benjamini–Hochberg method (Fig. 3).

D

Quality control

T

57

O

B. Issac et al. / Genomics Data xxx (2014) xxx–xxx

E

2

Fig. 1. Hierarchical clustering of miRNA expression data from human chondrosarcomas JJ012 (SARC) and human chondrocytes (C28) cell lines.

Please cite this article as: B. Issac, et al., Genome-wide mRNA and miRNA expression data analysis to screen for markers involved in sarcomagenesis in human chondrosarcoma cell lines, Genomics Data (2014), http://dx.doi.org/10.1016/j.gdata.2014.10.001

107 108

3

P

R O

O

F

B. Issac et al. / Genomics Data xxx (2014) xxx–xxx

110

Since one objective of the study was to cross-compare significant miRNA expression within groups to their target mRNA expression, statistical significance test was performed between human chondrosarcoma (SARC) and chrondrocytes (C28) as with miRNA expression data.

C E R R N C O

113

U

111 112

Multiple testing corrections on nominal P-values were performed using Benjamini–Hochberg method (Fig. 5). Predicted and validated gene targets for miRNAs were obtained from miRWalk database (http://www. umm.uni-heidelberg.de/apps/zmf/mirwalk/index.html). Gene expression data for targets of significant miRNAs were matched from the differential mRNA expression obtained above.

E

Basic analysis

T

109

D

Fig. 2. Principal component analysis (PCA) plot on miRNA expression data from human chondrosarcoma (SARC) and chondrocytes (C28) using the first two components.

Fig. 3. miRNAs with statistical significance after a T-test are shown in green on a volcano plot above. Those with marginal significance are shown in yellow, while those with low or no significance are shown in red and gray respectively.

Please cite this article as: B. Issac, et al., Genome-wide mRNA and miRNA expression data analysis to screen for markers involved in sarcomagenesis in human chondrosarcoma cell lines, Genomics Data (2014), http://dx.doi.org/10.1016/j.gdata.2014.10.001

114 115 116 117 118 119

B. Issac et al. / Genomics Data xxx (2014) xxx–xxx

E

D

P

R O

O

F

4

T

Fig. 4. Principal component analysis (PCA) plot using first 3 components on mRNA expression data from human chondrosarcoma (SARC) in blue and human chondrocytes (C28) in red.

Discussion

121 122

We describe here a unique miRNA and mRNA expression dataset from human chondrosarcoma (JJ012) and chondrocytes (C28) cell lines. This dataset comprises only array data from Exiqon

E

N

C

O

R

R

miRNA array and Illumina mRNA array. The expression data submitted here is of highest quality and has been used in studies published recently [3] in journals with wide subscription and impact, therefore would benefit wider biomedical research through further sharing.

U

123

C

120

Fig. 5. Volcano plot showing significantly (fold change N 1.5 and P-values b 0.05) differentially expressed genes in red after an unsupervised T-test on human chondrosarcoma (SARC) and chondrocytes (C28).

Please cite this article as: B. Issac, et al., Genome-wide mRNA and miRNA expression data analysis to screen for markers involved in sarcomagenesis in human chondrosarcoma cell lines, Genomics Data (2014), http://dx.doi.org/10.1016/j.gdata.2014.10.001

124 125 126 127 128

B. Issac et al. / Genomics Data xxx (2014) xxx–xxx

129 130

Conflict of interests None. Acknowledgments

132

The study was financially supported by Sylvester Comprehensive Cancer Center (SCCC) and University of Miami Tissue Bank.

134

[1] K.A. Galoian, H.T. Temple, A.A. Galoyan, mTORC1 inhibition and ECM–cell adhesionindependent drug resistance via PI3K–AKT and PI3K–RAS–MAPK feedback loops. Tumour Biol. 33 (3) (2012) 885–890. [2] K.A. Galoian, N. Garamszegi, S. Garamszegi, S. Scully, Molecular mechanism of tenascin-C action on matrix metalloproteinase-1 invasive potential. Exp. Biol. Med. 232 (4) (2007) 515–522. [3] K.A. Galoian, T. Guettouche, B. Issac, L. Navarro, Lost miRNA surveillance of Notch, IGFR pathway—road to sarcomagenesis. Tumor Biol. 35 (2014) 483–492.

135 136 137 138 139 140 141 142

O R O P D E T C E R R N C O U

133 143

References

F

131

5

Please cite this article as: B. Issac, et al., Genome-wide mRNA and miRNA expression data analysis to screen for markers involved in sarcomagenesis in human chondrosarcoma cell lines, Genomics Data (2014), http://dx.doi.org/10.1016/j.gdata.2014.10.001