Bioconductor Code: affxparser

Browse code

git-bioc: Copied updates from Git master branch to Bioconductor SVN

From: hb <[email protected]>

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@121021 bc3139a8-67e5-0310-9ffc-ced21a209358

H Bengtsson authored on 16/09/2016 22:01:49
Showing 1 changed files

R/writeCdf.private.R

History View file @ 24277d8

@@ -198,7 +198,7 @@
                                        ## Writing each group in turn
                                        # Number of bytes: (18+64)*nbrOfGroups + 14*totalNbrOfCells bytes
                                        groupDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     -  for(igroup in seq(along.with = unit$groups)) {
                                     +  for(igroup in seq_along(unit$groups)) {
                                          group <- unit$groups[[igroup]]
                                          groupDirection <- groupDirections[group$groupdirection];
                                          groupDirection <- switch(group$groupdirection,
@@ -229,7 +229,7 @@
                                                          ncol = 4)
                                          # Number of bytes: 14*nbrOfCells bytes
                                     -    for(icell in seq(along.with = group$x)) {
                                     +    for(icell in seq_along(group$x)) {
                                            # Number of bytes: 1*4+2*2+1*4+1*2=14 bytes
                                            writeBin(cells[icell, 1],
                                                     con = con, size = 4, endian = "little")
@@ -284,7 +284,7 @@
                                        cells <- matrix(as.integer(c(qcunit$x, qcunit$y, qcunit$length,
                                                                     qcunit$pm, qcunit$background)),
                                                        ncol = 5)
                                     -  for(icell in seq(along.with = qcunit$x)) {
                                     +  for(icell in seq_along(qcunit$x)) {
                                          writeBin(cells[icell, 1:2], con = con, size = 2, endian = "little")
                                          writeBin(cells[icell, 3:5], con = con, size = 1, endian = "little")
+                                       }

Browse code

Version: 1.33.3 [2013-06-29] o Same updates as in release v1.32.3.

...

Version: 1.32.3 [2013-06-29]
o BUG FIX: Since affxparser v1.30.2/1.31.2 (r72352; 2013-01-08),
writeCdf() would incorrectly encode the unit types, iff the input
'cdf' argument specified them as integers, e.g. as done by
writeCdf() for AffyGenePDInfo in aroma.affymetrix. More
specifically, the unit type index would be off by one, e.g. an
'expression' unit (1) would be encoded as an 'unknown' unit (0)
and so on. On the other hand, if they were specified by their
unit-type names (e.g. 'expression') the encoding should still be
correct, e.g. if input is constructed from readCdf() of affxparser.
Thanks to Guido Hooiveld at Wageningen UR (The Netherlands) for
reporting on this.
o BUG FIX: Similarily, writeCdf() has "always", at least affxparser
v1.7.4 since (r21888; 2007-01-09), encoded unit directions and
QC unit types incorrectly, iff they were specified as integers.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@78016 bc3139a8-67e5-0310-9ffc-ced21a209358

H Bengtsson authored on 29/06/2013 10:45:00
Showing 1 changed files

R/writeCdf.private.R

History View file @ 99b0778

@@ -158,23 +158,27 @@
                                      .writeCdfUnit <- function(unit, con, unitname=NULL) {
                                        ## 3. Write the unit
                                     -  unitDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     -  unitDirection <- unitDirections[unit$unitdirection];
                                     -  unitType <- switch(unit$unittype,
                                     -                     unknown = 0,
                                     -                     expression = 1,
                                     -                     genotyping = 2,
                                     -                     resequencing = 3,
                                     -                     tag = 4,
                                     -                     copynumber = 5,
                                     -                     genotypingcontrol = 6,
                                     -                     expressioncontrol = 7)
+                                    -
                                     -  unitDirection <- switch(unit$unitdirection,
                                     -                          nodirection = 0,
                                     -                          sense = 1,
                                     -                          antisense = 2,
                                     -                          unknown = 3)
                                     +  unitType <- unit$unittype
                                     +  if (!is.numeric(unitType)) {
                                     +    unitType <- switch(unitType,
                                     +                       unknown = 0,
                                     +                       expression = 1,
                                     +                       genotyping = 2,
                                     +                       resequencing = 3,
                                     +                       tag = 4,
                                     +                       copynumber = 5,
                                     +                       genotypingcontrol = 6,
                                     +                       expressioncontrol = 7)
                                     +  }
+                                    +
                                     +  unitDirection <- unit$unitdirection
                                     +  if (!is.numeric(unitDirection)) {
                                     +    unitDirection <- switch(unitDirection,
                                     +                            nodirection = 0,
                                     +                            sense = 1,
                                     +                            antisense = 2,
                                     +                            unknown = 3)
                                     +  }
                                        unitInfo <- as.integer(c(unitType, unitDirection,
                                                                 unit$natoms, length(unit$groups),
@@ -244,26 +248,29 @@
                                      .writeCdfQcUnit <- function(qcunit, con) {
                                        ## 2. Actually write the qcunit
                                     -  type <- switch(qcunit$type,
                                     -                 unknown = 0,
                                     -                 checkerboardNegative = 1,
                                     -                 checkerboardPositive = 2,
                                     -                 hybeNegative = 3,
                                     -                 hybePositive = 4,
                                     -                 textFeaturesNegative = 5,
                                     -                 textFeaturesPositive = 6,
                                     -                 centralNegative = 7,
                                     -                 centralPositive = 8,
                                     -                 geneExpNegative = 9,
                                     -                 geneExpPositive = 10,
                                     -                 cycleFidelityNegative = 11,
                                     -                 cycleFidelityPositive = 12,
                                     -                 centralCrossNegative = 13,
                                     -                 centralCrossPositive = 14,
                                     -                 crossHybeNegative = 15,
                                     -                 crossHybePositive = 16,
                                     -                 SpatialNormNegative = 17,
                                     -                 SpatialNormPositive = 18)
                                     +  type <- qcunit$type;
                                     +  if (!is.numeric(type)) {
                                     +    type <- switch(type,
                                     +                   unknown = 0,
                                     +                   checkerboardNegative = 1,
                                     +                   checkerboardPositive = 2,
                                     +                   hybeNegative = 3,
                                     +                   hybePositive = 4,
                                     +                   textFeaturesNegative = 5,
                                     +                   textFeaturesPositive = 6,
                                     +                   centralNegative = 7,
                                     +                   centralPositive = 8,
                                     +                   geneExpNegative = 9,
                                     +                   geneExpPositive = 10,
                                     +                   cycleFidelityNegative = 11,
                                     +                   cycleFidelityPositive = 12,
                                     +                   centralCrossNegative = 13,
                                     +                   centralCrossPositive = 14,
                                     +                   crossHybeNegative = 15,
                                     +                   crossHybePositive = 16,
                                     +                   SpatialNormNegative = 17,
                                     +                   SpatialNormPositive = 18)
                                     +  }
                                        # Write 2 + 4 bytes
                                        nbrOfBytes <- 6;
@@ -286,6 +293,12 @@
                                      ############################################################################
                                      # HISTORY:
                                     +# 2013-06-29
                                     +# o BUG FIX: Since affxparser 1.30.2/1.31.2, .writeCdfUnit() encoded unit
                                     +#   types incorrectly, iff specified as integers.
                                     +# o BUG FIX: Likewise, .writeCdfUnit() has always encoded unit directions
                                     +#   incorrectly, iff specified as integers.  Moreover, .writeCdfQcUnit()
                                     +#   has always encoded unit types incorrectly, iff specified as integers.
                                      # 2013-05-25 /HB
                                      # o Removed all gc() in .initializeCdf().
                                      # 2013-01-07 /HB

Browse code

Version: 1.33.2 [2013-05-25] o SPEEDUP: Removed all remaining gc() calls. o SPEEDUP: Replaced all rm() calls with NULL assignments.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@76889 bc3139a8-67e5-0310-9ffc-ced21a209358

H Bengtsson authored on 25/05/2013 22:34:42
Showing 1 changed files

R/writeCdf.private.R

History View file @ 95092fc

@@ -1,306 +1,293 @@
                                      .initializeCdf <- function(con, nRows = 1, nCols = 1,
                                     -                          nUnits = 1, nQcUnits = 0,
                                     -                          refSeq = "",
                                     -                          unitnames = rep("", nUnits),
                                     -                          qcUnitLengths = rep(0, nQcUnits),
                                     -                          unitLengths = rep(0, nUnits),
                                     -                          ...) {
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    # Validate arguments
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    if(length(qcUnitLengths) != nQcUnits) {
                                     -      stop("Number of elements in argument 'qcUnitLengths' does not match 'nQcUnits'");
                                     +                           nUnits = 1, nQcUnits = 0,
                                     +                           refSeq = "",
                                     +                           unitnames = rep("", nUnits),
                                     +                           qcUnitLengths = rep(0, nQcUnits),
                                     +                           unitLengths = rep(0, nUnits),
                                     +                           ...) {
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  # Validate arguments
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  if(length(qcUnitLengths) != nQcUnits) {
                                     +    stop("Number of elements in argument 'qcUnitLengths' does not match 'nQcUnits'");
                                     +  }
+                                    +
                                     +  if(length(unitLengths) != nUnits) {
                                     +    stop("Number of elements in argument 'unitLengths' does not match 'nUnits'");
                                     +  }
+                                    +
                                     +  if(length(refSeq) != 1) {
                                     +    stop("Argument 'refSeq' should be a single character.");
                                     +  }
+                                    +
                                     +  lrefSeq <- nchar(refSeq);
+                                    +
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  # CDF header
                                     +  #
                                     +  # 1 Magic number. Always set to 67.                           [integer]
                                     +  # 2 Version number.                                           [integer]
                                     +  # 3 The number of columns of cells on the array.       [unsigned short]
                                     +  # 4 The number of rows of cells on the array.          [unsigned short]
                                     +  # 5 The number of units in the array not including QC units. The term
                                     +  #   unit is an internal term which means probe set.           [integer]
                                     +  # 6 The number of QC units.                                   [integer]
                                     +  # 7 The length of the resequencing reference sequence.        [integer]
                                     +  # 8 The resequencing reference sequence.                    [char[len]]
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  offset <- 0;
+                                    +
                                     +  ## Magic number and version number
                                     +  writeBin(object = as.integer(c(67, 1)),
                                     +           con = con, size = 4, endian = "little")
                                     +  ## NCols, NRows
                                     +  writeBin(object = as.integer(c(nCols, nRows)),
                                     +           con = con, size = 2, endian = "little")
                                     +  ## NumberUnits, NumberQCUnits
                                     +  writeBin(object = as.integer(c(nUnits, nQcUnits)),
                                     +           con = con, size = 4, endian = "little")
                                     +  ## Length of refSeqsequence
                                     +  writeBin(object = as.integer(lrefSeq),
                                     +           con = con, size = 4, endian = "little")
                                     +  offset <- 24;
+                                    +
                                     +  fOffset <- seek(con=con, origin="start", rw="write");
                                     +  if (offset != fOffset) {
                                     +    stop("File format write error (step 1): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +  }
+                                    +
                                     +  ## RefSeqsequece
                                     +  if(lrefSeq > 0)
                                     +    writeChar(as.character(refSeq), con=con, eos=NULL);
+                                    +
                                     +  # Current offset
                                     +  offset <- offset + lrefSeq;
+                                    +
                                     +  fOffset <- seek(con=con, origin="start", rw="write");
                                     +  if (offset != fOffset) {
                                     +    stop("File format write error (step 2): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +  }
+                                    +
+                                    +
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  # Unit names
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  # Write to raw vector (2*10^6 units => 122Mb; should be ok for now)
                                     +  # Since we can't create strings with '\0':s, we use '\xFF',
                                     +  # write to raw and then replace '\xFF' with '\0'. Thus, unit names with
                                     +  # '\xFF' are invalid, but this should not be a real problem.
                                     +  pads <- sapply(0:64, FUN=function(x) paste(rep("\xFF", x), collapse=""));
+                                    +
                                     +  # Write the unit names in chunks to save memory
                                     +  nbrOfUnits <- length(unitnames);
                                     +  chunkSize <- 100000;
                                     +  nbrOfChunks <- ceiling(nbrOfUnits / chunkSize);
+                                    +
                                     +  # Allocate raw vector
                                     +  raw <- raw(64*chunkSize);
+                                    +
                                     +  for (kk in 1:nbrOfChunks) {
                                     +    # Units for this chunk
                                     +    from <- (kk-1)*chunkSize+1;
                                     +    to <- min(from+chunkSize-1, nbrOfUnits);
                                     +    unitnamesFF <- unitnames[from:to];
+                                    +
                                     +    # Pad the unit names
                                     +    unitnamesFF <- paste(unitnamesFF, pads[64-nchar(unitnamesFF)], sep="");
+                                    +
                                     +    # Truncate last chunk?
                                     +    if (chunkSize > length(unitnamesFF)) {
                                     +      raw <- raw[1:(64*length(unitnamesFF))];
+                                         }
                                     -    if(length(unitLengths) != nUnits) {
                                     -      stop("Number of elements in argument 'unitLengths' does not match 'nUnits'");
                                     -    }
+                                    -
                                     -    if(length(refSeq) != 1)
                                     -        stop("Argument 'refSeq' should be a single character.");
+                                    -
                                     -    lrefSeq <- nchar(refSeq);
+                                    -
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    # CDF header
                                     -    #
                                     -    # 1 Magic number. Always set to 67.                           [integer]
                                     -    # 2 Version number.                                           [integer]
                                     -    # 3 The number of columns of cells on the array.       [unsigned short]
                                     -    # 4 The number of rows of cells on the array.          [unsigned short]
                                     -    # 5 The number of units in the array not including QC units. The term
                                     -    #   unit is an internal term which means probe set.           [integer]
                                     -    # 6 The number of QC units.                                   [integer]
                                     -    # 7 The length of the resequencing reference sequence.        [integer]
                                     -    # 8 The resequencing reference sequence.                    [char[len]]
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    offset <- 0;
+                                    -
                                     -    ## Magic number and version number
                                     -    writeBin(object = as.integer(c(67, 1)),
                                     -             con = con, size = 4, endian = "little")
                                     -    ## NCols, NRows
                                     -    writeBin(object = as.integer(c(nCols, nRows)),
                                     -             con = con, size = 2, endian = "little")
                                     -    ## NumberUnits, NumberQCUnits
                                     -    writeBin(object = as.integer(c(nUnits, nQcUnits)),
                                     -             con = con, size = 4, endian = "little")
                                     -    ## Length of refSeqsequence
                                     -    writeBin(object = as.integer(lrefSeq),
                                     -             con = con, size = 4, endian = "little")
                                     -    offset <- 24;
+                                    -
                                     -    fOffset <- seek(con=con, origin="start", rw="write");
                                     -    if (offset != fOffset) {
                                     -      stop("File format write error (step 1): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     -    }
+                                    -
                                     -    ## RefSeqsequece
                                     -    if(lrefSeq > 0)
                                     -      writeChar(as.character(refSeq), con=con, eos=NULL);
+                                    -
                                     -    # Current offset
                                     -    offset <- offset + lrefSeq;
+                                    -
                                     -    fOffset <- seek(con=con, origin="start", rw="write");
                                     -    if (offset != fOffset) {
                                     -      stop("File format write error (step 2): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     -    }
+                                    -
+                                    -
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    # Unit names
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    # Write to raw vector (2*10^6 units => 122Mb; should be ok for now)
                                     -    # Since we can't create strings with '\0':s, we use '\xFF',
                                     -    # write to raw and then replace '\xFF' with '\0'. Thus, unit names with
                                     -    # '\xFF' are invalid, but this should not be a real problem.
                                     -    pads <- sapply(0:64, FUN=function(x) paste(rep("\xFF", x), collapse=""));
+                                    -
                                     -    # Write the unit names in chunks to save memory
                                     -    nbrOfUnits <- length(unitnames);
                                     -    chunkSize <- 100000;
                                     -    nbrOfChunks <- ceiling(nbrOfUnits / chunkSize);
+                                    -
                                     -    # Allocate raw vector
                                     -    raw <- raw(64*chunkSize);
+                                    -
                                     -    for (kk in 1:nbrOfChunks) {
                                     -      # Units for this chunk
                                     -      from <- (kk-1)*chunkSize+1;
                                     -      to <- min(from+chunkSize-1, nbrOfUnits);
                                     -      unitnamesFF <- unitnames[from:to];
+                                    -
                                     -      # Pad the unit names
                                     -      unitnamesFF <- paste(unitnamesFF, pads[64-nchar(unitnamesFF)], sep="");
+                                    -
                                     -      # Truncate last chunk?
                                     -      if (chunkSize > length(unitnamesFF)) {
                                     -        raw <- raw[1:(64*length(unitnamesFF))];
                                     -      }
+                                    -
                                     -      # Write unit names to raw vector
                                     -      raw <- writeBin(con=raw, unitnamesFF, size=1);
+                                    -
                                     -      rm(unitnamesFF);
+                                    -
                                     -      # Garbage collect
                                     -#      gc <- gc();
                                     -#      print(gc);
+                                    -
                                     -      # Replace all '\xFF' with '\0'.
                                     -      idxs <- which(raw == as.raw(255));
                                     -      raw[idxs] <- as.raw(0);
                                     -      rm(idxs);
+                                    -
                                     -      writeBin(con=con, raw);
                                     -   } # for (kk in ...)
+                                    -
                                     -   rm(raw);
                                     -   # Garbage collect
                                     -   gc <- gc();
+                                    -
                                     -#    writeChar(con=con, as.character(unitnames), nchars=rep(64, nUnits), eos=NULL)
+                                    -
                                     -    bytesOfUnitNames <- 64 * nUnits;
                                     -    offset <- offset + bytesOfUnitNames;
+                                    -
                                     -    fOffset <- seek(con=con, origin="start", rw="write");
                                     -    if (offset != fOffset) {
                                     -      stop("File format write error (step 3): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     -    }
+                                    -
                                     -    bytesOfQcUnits <- 4 * nQcUnits;
                                     -    offset <- offset + bytesOfQcUnits;
+                                    -
                                     -    bytesOfUnits <- 4 * nUnits;
                                     -    offset <- offset + bytesOfUnits;
+                                    -
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    # QC units file positions
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    if (nQcUnits > 0) {
                                     -      csum <- cumsum(qcUnitLengths);
                                     -      nextOffset <- csum[nQcUnits];
                                     -      starts <- c(0, csum[-nQcUnits]);
                                     -      starts <- as.integer(offset + starts);
                                     -      writeBin(starts, con = con, size = 4, endian = "little")
                                     -    } else {
                                     -      nextOffset <- 0;
                                     -#      starts <- 0;
                                     -#      starts <- as.integer(offset + starts);
                                     -#      writeBin(starts, con = con, size = 4, endian = "little")
                                     -    }
+                                    -
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    # Units file positions
                                     -    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    offset <- offset + nextOffset;
                                     -    if (nUnits > 0) {
                                     -      csum <- cumsum(unitLengths);
                                     -      nextOffset <- csum[nUnits];
                                     -      starts <- c(0, csum[-nUnits]);
                                     -      starts <- as.integer(offset + starts);
                                     -      writeBin(starts, con = con, size = 4, endian = "little");
                                     -    } else {
                                     -      nextOffset <- 0;
                                     -    }
                                     +    # Write unit names to raw vector
                                     +    raw <- writeBin(con=raw, unitnamesFF, size=1);
                                     +    unitnamesFF <- NULL; # Not needed anymore
+                                    +
                                     +    # Replace all '\xFF' with '\0'.
                                     +    idxs <- which(raw == as.raw(255));
                                     +    raw[idxs] <- as.raw(0);
                                     +    idxs <- NULL; # Not needed anymore
+                                    +
                                     +    writeBin(con=con, raw);
                                     +  } # for (kk in ...)
                                     +  raw <- NULL; # Not needed anymore
+                                    +
                                     +  bytesOfUnitNames <- 64 * nUnits;
                                     +  offset <- offset + bytesOfUnitNames;
+                                    +
                                     +  fOffset <- seek(con=con, origin="start", rw="write");
                                     +  if (offset != fOffset) {
                                     +    stop("File format write error (step 3): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +  }
+                                    +
                                     +  bytesOfQcUnits <- 4 * nQcUnits;
                                     +  offset <- offset + bytesOfQcUnits;
+                                    +
                                     +  bytesOfUnits <- 4 * nUnits;
                                     +  offset <- offset + bytesOfUnits;
+                                    +
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  # QC units file positions
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  if (nQcUnits > 0) {
                                     +    csum <- cumsum(qcUnitLengths);
                                     +    nextOffset <- csum[nQcUnits];
                                     +    starts <- c(0, csum[-nQcUnits]);
                                     +    starts <- as.integer(offset + starts);
                                     +    writeBin(starts, con = con, size = 4, endian = "little")
                                     +  } else {
                                     +    nextOffset <- 0;
                                     +  }
+                                    +
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  # Units file positions
                                     +  # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +  offset <- offset + nextOffset;
                                     +  if (nUnits > 0) {
                                     +    csum <- cumsum(unitLengths);
                                     +    nextOffset <- csum[nUnits];
                                     +    starts <- c(0, csum[-nUnits]);
                                     +    starts <- as.integer(offset + starts);
                                     +    writeBin(starts, con = con, size = 4, endian = "little");
                                     +  } else {
                                     +    nextOffset <- 0;
                                     +  }
                                      } # .initializeCdf()
                                      .writeCdfUnit <- function(unit, con, unitname=NULL) {
                                     -    ## 3. Write the unit
                                     -    unitDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     -    unitDirection <- unitDirections[unit$unitdirection];
                                     -    unitType <- switch(unit$unittype,
                                     -                       unknown = 0,
                                     -                       expression = 1,
                                     -                       genotyping = 2,
                                     -                       resequencing = 3,
                                     -                       tag = 4,
                                     -                       copynumber = 5,
                                     -                       genotypingcontrol = 6,
                                     -                       expressioncontrol = 7)
+                                    -
                                     -    unitDirection <- switch(unit$unitdirection,
                                     -                            nodirection = 0,
                                     -                            sense = 1,
                                     -                            antisense = 2,
                                     -                            unknown = 3)
+                                    -
                                     -    unitInfo <- as.integer(c(unitType, unitDirection,
                                     -                             unit$natoms, length(unit$groups),
                                     -                             unit$ncells, unit$unitnumber,
                                     -                             unit$ncellsperatom))
+                                    -
                                     -    # Number of bytes: 2+1+4*4+1=20 bytes
                                     -    writeBin(unitInfo[1],
                                     -             con = con, size = 2, endian = "little")
                                     -    writeBin(unitInfo[2],
                                     -             con = con, size = 1, endian = "little")
                                     -    writeBin(unitInfo[3:6],
                                     +  ## 3. Write the unit
                                     +  unitDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     +  unitDirection <- unitDirections[unit$unitdirection];
                                     +  unitType <- switch(unit$unittype,
                                     +                     unknown = 0,
                                     +                     expression = 1,
                                     +                     genotyping = 2,
                                     +                     resequencing = 3,
                                     +                     tag = 4,
                                     +                     copynumber = 5,
                                     +                     genotypingcontrol = 6,
                                     +                     expressioncontrol = 7)
+                                    +
                                     +  unitDirection <- switch(unit$unitdirection,
                                     +                          nodirection = 0,
                                     +                          sense = 1,
                                     +                          antisense = 2,
                                     +                          unknown = 3)
+                                    +
                                     +  unitInfo <- as.integer(c(unitType, unitDirection,
                                     +                           unit$natoms, length(unit$groups),
                                     +                           unit$ncells, unit$unitnumber,
                                     +                           unit$ncellsperatom))
+                                    +
                                     +  # Number of bytes: 2+1+4*4+1=20 bytes
                                     +  writeBin(unitInfo[1],
                                     +           con = con, size = 2, endian = "little")
                                     +  writeBin(unitInfo[2],
                                     +           con = con, size = 1, endian = "little")
                                     +  writeBin(unitInfo[3:6],
                                     +           con = con, size = 4, endian = "little")
                                     +  writeBin(unitInfo[7],
                                     +           con = con, size = 1, endian = "little")
+                                    +
                                     +  ## Writing each group in turn
                                     +  # Number of bytes: (18+64)*nbrOfGroups + 14*totalNbrOfCells bytes
                                     +  groupDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     +  for(igroup in seq(along.with = unit$groups)) {
                                     +    group <- unit$groups[[igroup]]
                                     +    groupDirection <- groupDirections[group$groupdirection];
                                     +    groupDirection <- switch(group$groupdirection,
                                     +                             nodirection = 0,
                                     +                             sense = 1,
                                     +                             antisense = 2,
                                     +                             unknown = 3)
                                     +    groupInfo <- as.integer(c(group$natoms, length(group$x),
                                     +                              group$ncellsperatom,
                                     +                              groupDirection, min(group$atoms, 0)))
                                     +    # Number of bytes: 2*4+2*1+2*4=18 bytes
                                     +    writeBin(groupInfo[1:2],
                                                   con = con, size = 4, endian = "little")
                                     -    writeBin(unitInfo[7],
                                     +    writeBin(groupInfo[3:4],
                                                   con = con, size = 1, endian = "little")
                                     +    writeBin(groupInfo[5:6],
                                     +             con = con, size = 4, endian = "little")
                                     -    ## Writing each group in turn
                                     -    # Number of bytes: (18+64)*nbrOfGroups + 14*totalNbrOfCells bytes
                                     -    groupDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     -    for(igroup in seq(along.with = unit$groups)) {
                                     -        group <- unit$groups[[igroup]]
                                     -        groupDirection <- groupDirections[group$groupdirection];
                                     -        groupDirection <- switch(group$groupdirection,
                                     -                                 nodirection = 0,
                                     -                                 sense = 1,
                                     -                                 antisense = 2,
                                     -                                 unknown = 3)
                                     -        groupInfo <- as.integer(c(group$natoms, length(group$x),
                                     -                                  group$ncellsperatom,
                                     -                                  groupDirection, min(group$atoms, 0)))
                                     -       # Number of bytes: 2*4+2*1+2*4=18 bytes
                                     -        writeBin(groupInfo[1:2],
                                     -                 con = con, size = 4, endian = "little")
                                     -        writeBin(groupInfo[3:4],
                                     -                 con = con, size = 1, endian = "little")
                                     -        writeBin(groupInfo[5:6],
                                     -                 con = con, size = 4, endian = "little")
+                                    -
                                     -        # Number of bytes: 64 bytes
                                     -        suppressWarnings({
                                     -          writeChar(as.character(names(unit$groups)[igroup]),
                                     -                    con = con, nchars = 64, eos = NULL)
                                     -        })
+                                    -
                                     -        ## Writing each cell in turn
                                     -#        cells <- matrix(as.integer(c(group$atom, group$x,
                                     -#                                     group$y, group$indexpos)),
                                     -#                        ncol = 4)
                                     -        cells <- matrix(as.integer(c(group$indexpos, group$x,
                                     -                                     group$y, group$atom)),
                                     -                        ncol = 4)
+                                    -
                                     -        # Number of bytes: 14*nbrOfCells bytes
                                     -        for(icell in seq(along.with = group$x)) {
                                     -            # Number of bytes: 1*4+2*2+1*4+1*2=14 bytes
                                     -            writeBin(cells[icell, 1],
                                     -                     con = con, size = 4, endian = "little")
                                     -            writeBin(cells[icell, 2:3],
                                     -                     con = con, size = 2, endian = "little")
                                     -            writeBin(cells[icell, 4],
                                     -                     con = con, size = 4, endian = "little")
                                     -            writeChar(as.character(c(group$pbase[icell],
                                     -                                     group$tbase[icell])),
                                     -                      con = con, nchars = c(1,1), eos = NULL)
                                     -        }
                                     -    }
                                     +    # Number of bytes: 64 bytes
                                     +    suppressWarnings({
                                     +      writeChar(as.character(names(unit$groups)[igroup]),
                                     +                con = con, nchars = 64, eos = NULL)
                                     +    })
+                                    +
                                     +    ## Writing each cell in turn
                                     +    cells <- matrix(as.integer(c(group$indexpos, group$x,
                                     +                                 group$y, group$atom)),
                                     +                    ncol = 4)
+                                    +
                                     +    # Number of bytes: 14*nbrOfCells bytes
                                     +    for(icell in seq(along.with = group$x)) {
                                     +      # Number of bytes: 1*4+2*2+1*4+1*2=14 bytes
                                     +      writeBin(cells[icell, 1],
                                     +               con = con, size = 4, endian = "little")
                                     +      writeBin(cells[icell, 2:3],
                                     +               con = con, size = 2, endian = "little")
                                     +      writeBin(cells[icell, 4],
                                     +               con = con, size = 4, endian = "little")
                                     +      writeChar(as.character(c(group$pbase[icell],
                                     +                               group$tbase[icell])),
                                     +                con = con, nchars = c(1,1), eos = NULL)
                                     +    } # for (icell ...)
                                     +  } # for (igroup ...)
                                      } # .writeCdfUnit()
                                      .writeCdfQcUnit <- function(qcunit, con) {
                                     -    ## 2. Actually write the qcunit
                                     -    type <- switch(qcunit$type,
                                     -                   unknown = 0,
                                     -                   checkerboardNegative = 1,
                                     -                   checkerboardPositive = 2,
                                     -                   hybeNegative = 3,
                                     -                   hybePositive = 4,
                                     -                   textFeaturesNegative = 5,
                                     -                   textFeaturesPositive = 6,
                                     -                   centralNegative = 7,
                                     -                   centralPositive = 8,
                                     -                   geneExpNegative = 9,
                                     -                   geneExpPositive = 10,
                                     -                   cycleFidelityNegative = 11,
                                     -                   cycleFidelityPositive = 12,
                                     -                   centralCrossNegative = 13,
                                     -                   centralCrossPositive = 14,
                                     -                   crossHybeNegative = 15,
                                     -                   crossHybePositive = 16,
                                     -                   SpatialNormNegative = 17,
                                     -                   SpatialNormPositive = 18)
+                                    -
                                     -    # Write 2 + 4 bytes
                                     -    nbrOfBytes <- 6;
                                     -    qcunitInfo <- as.integer(c(type, qcunit$ncells))
                                     -    writeBin(qcunitInfo[1], con = con, size = 2, endian = "little")
                                     -    writeBin(qcunitInfo[2], con = con, size = 4, endian = "little")
+                                    -
                                     -    # Write 2 + 4 bytes
                                     -    nCells <- length(qcunit$x);
                                     -    nbrOfBytes <- 7*nCells;
                                     -    cells <- matrix(as.integer(c(qcunit$x, qcunit$y, qcunit$length,
                                     -                                 qcunit$pm, qcunit$background)),
                                     -                    ncol = 5)
                                     -    for(icell in seq(along.with = qcunit$x)) {
                                     -        writeBin(cells[icell, 1:2], con = con, size = 2, endian = "little")
                                     -        writeBin(cells[icell, 3:5], con = con, size = 1, endian = "little")
                                     -    }
                                     +  ## 2. Actually write the qcunit
                                     +  type <- switch(qcunit$type,
                                     +                 unknown = 0,
                                     +                 checkerboardNegative = 1,
                                     +                 checkerboardPositive = 2,
                                     +                 hybeNegative = 3,
                                     +                 hybePositive = 4,
                                     +                 textFeaturesNegative = 5,
                                     +                 textFeaturesPositive = 6,
                                     +                 centralNegative = 7,
                                     +                 centralPositive = 8,
                                     +                 geneExpNegative = 9,
                                     +                 geneExpPositive = 10,
                                     +                 cycleFidelityNegative = 11,
                                     +                 cycleFidelityPositive = 12,
                                     +                 centralCrossNegative = 13,
                                     +                 centralCrossPositive = 14,
                                     +                 crossHybeNegative = 15,
                                     +                 crossHybePositive = 16,
                                     +                 SpatialNormNegative = 17,
                                     +                 SpatialNormPositive = 18)
+                                    +
                                     +  # Write 2 + 4 bytes
                                     +  nbrOfBytes <- 6;
                                     +  qcunitInfo <- as.integer(c(type, qcunit$ncells))
                                     +  writeBin(qcunitInfo[1], con = con, size = 2, endian = "little")
                                     +  writeBin(qcunitInfo[2], con = con, size = 4, endian = "little")
+                                    +
                                     +  # Write 2 + 4 bytes
                                     +  nCells <- length(qcunit$x);
                                     +  nbrOfBytes <- 7*nCells;
                                     +  cells <- matrix(as.integer(c(qcunit$x, qcunit$y, qcunit$length,
                                     +                               qcunit$pm, qcunit$background)),
                                     +                  ncol = 5)
                                     +  for(icell in seq(along.with = qcunit$x)) {
                                     +    writeBin(cells[icell, 1:2], con = con, size = 2, endian = "little")
                                     +    writeBin(cells[icell, 3:5], con = con, size = 1, endian = "little")
                                     +  }
                                      } # .writeCdfQcUnit()
                                      ############################################################################
                                      # HISTORY:
                                     +# 2013-05-25 /HB
                                     +# o Removed all gc() in .initializeCdf().
                                      # 2013-01-07 /HB
                                      # o GENERALIZATION: .writeCdfUnit() now also encodes unit types
                                      #   'genotypingcontrol' and 'expressioncontrol'.

Browse code

Version: 1.31.2 [2013-01-07] o BUG FIX: writeCdf() did not encode unit types as decoded by readCdf(). Unit type 'unknown' was incorrectly encoded such that readCdf() would decode it as 'copynumber'. Also, unit types 'genotypingcontrol' and 'expressioncontrol' where not encoded at all.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@72352 bc3139a8-67e5-0310-9ffc-ced21a209358

H Bengtsson authored on 08/01/2013 02:58:16
Showing 1 changed files

R/writeCdf.private.R

History View file @ 006d815

@@ -170,30 +170,17 @@
                                      .writeCdfUnit <- function(unit, con, unitname=NULL) {
                                          ## 3. Write the unit
                                     -##    unitTypes <- c(expression=1, genotyping=2, tag=3,
                                     -##                                             resequencing=4, unknown=5);
                                     -##
                                     -##    unitType <- unitTypes[unit$unittype];
                                          unitDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                          unitDirection <- unitDirections[unit$unitdirection];
+                                    -
                                     -##    unitType <- switch(unit$unittype,
                                     -##                       expression = 1,
                                     -##                       genotyping = 2,
                                     -##                       tag = 3,
                                     -##                       resequencing = 4,
                                     -##                       unknown = 5)
+                                    -
                                     -    # In some version of the Fusion SDK documentation, the unit type
                                     -    # with value 5 (five) was labelled "unknown".  For backward
                                     -    # compatibility we recognize input value "unknown" as well.
                                          unitType <- switch(unit$unittype,
                                     +                       unknown = 0,
                                                             expression = 1,
                                                             genotyping = 2,
                                                             resequencing = 3,
                                                             tag = 4,
                                                             copynumber = 5,
                                     -                       unknown = 5)
                                     +                       genotypingcontrol = 6,
                                     +                       expressioncontrol = 7)
                                          unitDirection <- switch(unit$unitdirection,
                                                                  nodirection = 0,
@@ -314,6 +301,11 @@
                                      ############################################################################
                                      # HISTORY:
                                     +# 2013-01-07 /HB
                                     +# o GENERALIZATION: .writeCdfUnit() now also encodes unit types
                                     +#   'genotypingcontrol' and 'expressioncontrol'.
                                     +# o BUG FIX: .writeCdfUnit() incorrectly encoded the 'unknown' unit type
                                     +#   as 5 and not 0.
                                      # 2008-08-09 /HB
                                      # o BUG FIX: .writeCdfUnit() did output unit type 'resequencing' and 'tag'
                                      #   as 4 and 3, and not 3 and 4, respectively.

Browse code

Version: 1.29.1 [2012-05-18] o Replaced several throw() with stop(), because the former assumes that R.methodsS3 is loaded, which it may not be. o ROBUSTNESS: Added a system test forvalidating that the package can write and read a CDF. The test is spawning of another R process so that the test is robust against core dumps.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@66038 bc3139a8-67e5-0310-9ffc-ced21a209358

Henrik Bengtsson authored on 19/05/2012 06:18:26
Showing 1 changed files

R/writeCdf.private.R

History View file @ b5de40e

@@ -52,7 +52,7 @@
                                          fOffset <- seek(con=con, origin="start", rw="write");
                                          if (offset != fOffset) {
                                     -      throw("File format write error (step 1): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +      stop("File format write error (step 1): File offset is not the excepted one: ", fOffset, " != ", offset);
+                                         }
                                          ## RefSeqsequece
@@ -64,7 +64,7 @@
                                          fOffset <- seek(con=con, origin="start", rw="write");
                                          if (offset != fOffset) {
                                     -      throw("File format write error (step 2): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +      stop("File format write error (step 2): File offset is not the excepted one: ", fOffset, " != ", offset);
+                                         }
@@ -127,7 +127,7 @@
                                          fOffset <- seek(con=con, origin="start", rw="write");
                                          if (offset != fOffset) {
                                     -      throw("File format write error (step 3): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +      stop("File format write error (step 3): File offset is not the excepted one: ", fOffset, " != ", offset);
+                                         }
                                          bytesOfQcUnits <- 4 * nQcUnits;

Browse code

## Will wait to bump/rebuild this until further validated: ## ## Version: 1.13.5 [2008-08-09] ## o BUG FIX: writeCdf() would write 'CustomSeq' units ## as 'Tag' units, and vice versa. This means that ## ASCII CDFs containing such units and converted with ## convertCdf() would be have an incorrect unit type for ## these units. Also, unit type 'Copy Number' is ## reported as "copynumber" and no longer as "unknown". ## o BUG FIX: The increase of the internal buffer for ## reading the 'refseq' header field of ASCII CDFs that ## was done in v1.11.2 was mistakenly undone in v1.13.3. ## o Made readCdf() recognize more unit types.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@33157 bc3139a8-67e5-0310-9ffc-ced21a209358

Henrik Bengtsson authored on 10/08/2008 09:06:09
Showing 1 changed files

R/writeCdf.private.R

History View file @ efdb07d

@@ -170,18 +170,31 @@
                                      .writeCdfUnit <- function(unit, con, unitname=NULL) {
                                          ## 3. Write the unit
                                     -    unitTypes <- c(expression=1, genotyping=2, tag=3,
                                     -                                             resequencing=4, unknown=5);
                                     -    unitType <- unitTypes[unit$unittype];
                                     +##    unitTypes <- c(expression=1, genotyping=2, tag=3,
                                     +##                                             resequencing=4, unknown=5);
                                     +##
                                     +##    unitType <- unitTypes[unit$unittype];
                                          unitDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                          unitDirection <- unitDirections[unit$unitdirection];
                                     +##    unitType <- switch(unit$unittype,
                                     +##                       expression = 1,
                                     +##                       genotyping = 2,
                                     +##                       tag = 3,
                                     +##                       resequencing = 4,
                                     +##                       unknown = 5)
+                                    +
                                     +    # In some version of the Fusion SDK documentation, the unit type
                                     +    # with value 5 (five) was labelled "unknown".  For backward
                                     +    # compatibility we recognize input value "unknown" as well.
                                          unitType <- switch(unit$unittype,
                                                             expression = 1,
                                                             genotyping = 2,
                                     -                       tag = 3,
                                     -                       resequencing = 4,
                                     +                       resequencing = 3,
                                     +                       tag = 4,
                                     +                       copynumber = 5,
                                                             unknown = 5)
+                                    +
                                          unitDirection <- switch(unit$unitdirection,
                                                                  nodirection = 0,
                                                                  sense = 1,
@@ -301,8 +314,11 @@
                                      ############################################################################
                                      # HISTORY:
                                     +# 2008-08-09 /HB
                                     +# o BUG FIX: .writeCdfUnit() did output unit type 'resequencing' and 'tag'
                                     +#   as 4 and 3, and not 3 and 4, respectively.
                                      # 2007-11-13 /KH
                                     -# o BUG FIX: The rrror message in internal .initializeCdf() would mention
                                     +# o BUG FIX: The error message in internal .initializeCdf() would mention
                                      #   'qcUnitLengths' when it was meant to say 'unitLengths'.
                                      # 2007-07-13 /HB
                                      # o While writing unit names in .initializeCdf(), quite a few copies were

Browse code

# Added HISTORY comment to writeCdf.private.R. # Corrected JHB comment CDFFileData.cpp. # Updated inst/HISTORY file with correct updates. # Migrated all BUG FIXES/TYPO FIXES to the release version as well.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@29108 bc3139a8-67e5-0310-9ffc-ced21a209358

Henrik Bengtsson authored on 08/12/2007 19:20:14
Showing 1 changed files

R/writeCdf.private.R

History View file @ bab64c3

@@ -301,6 +301,9 @@
                                      ############################################################################
                                      # HISTORY:
                                     +# 2007-11-13 /KH
                                     +# o BUG FIX: The rrror message in internal .initializeCdf() would mention
                                     +#   'qcUnitLengths' when it was meant to say 'unitLengths'.
                                      # 2007-07-13 /HB
                                      # o While writing unit names in .initializeCdf(), quite a few copies were
                                      #   created using up a lot of memory.  By removing unused objects and

Browse code

Bugfix in .initializeCdf thanks to bug report from E. Purdum

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@28691 bc3139a8-67e5-0310-9ffc-ced21a209358

Kasper D. Hansen authored on 13/11/2007 23:42:35
Showing 1 changed files

R/writeCdf.private.R

History View file @ 0224bd3

@@ -13,7 +13,7 @@
+                                         }
                                          if(length(unitLengths) != nUnits) {
                                     -      stop("Number of elements in argument 'qcUnitLengths' does not match 'nUnits'");
                                     +      stop("Number of elements in argument 'unitLengths' does not match 'nUnits'");
+                                         }
                                          if(length(refSeq) != 1)
@@ -28,7 +28,7 @@
                                          # 2 Version number.                                           [integer]
                                          # 3 The number of columns of cells on the array.       [unsigned short]
                                          # 4 The number of rows of cells on the array.          [unsigned short]
                                     -    # 5 The number of units in the array not including QC units. The term
                                     +    # 5 The number of units in the array not including QC units. The term
                                          #   unit is an internal term which means probe set.           [integer]
                                          # 6 The number of QC units.                                   [integer]
                                          # 7 The length of the resequencing reference sequence.        [integer]
@@ -53,8 +53,8 @@
                                          fOffset <- seek(con=con, origin="start", rw="write");
                                          if (offset != fOffset) {
                                            throw("File format write error (step 1): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     -    }
+                                    -
                                     +    }
+                                    +
                                          ## RefSeqsequece
                                          if(lrefSeq > 0)
                                            writeChar(as.character(refSeq), con=con, eos=NULL);
@@ -65,7 +65,7 @@
                                          fOffset <- seek(con=con, origin="start", rw="write");
                                          if (offset != fOffset) {
                                            throw("File format write error (step 2): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     -    }
                                     +    }
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
@@ -128,7 +128,7 @@
                                          fOffset <- seek(con=con, origin="start", rw="write");
                                          if (offset != fOffset) {
                                            throw("File format write error (step 3): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     -    }
                                     +    }
                                          bytesOfQcUnits <- 4 * nQcUnits;
                                          offset <- offset + bytesOfQcUnits;
@@ -170,7 +170,7 @@
                                      .writeCdfUnit <- function(unit, con, unitname=NULL) {
                                          ## 3. Write the unit
                                     -    unitTypes <- c(expression=1, genotyping=2, tag=3,
                                     +    unitTypes <- c(expression=1, genotyping=2, tag=3,
                                                                                   resequencing=4, unknown=5);
                                          unitType <- unitTypes[unit$unittype];
                                          unitDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
@@ -193,7 +193,7 @@
                                                                   unit$ncells, unit$unitnumber,
                                                                   unit$ncellsperatom))
                                     -    # Number of bytes: 2+1+4*4+1=20 bytes
                                     +    # Number of bytes: 2+1+4*4+1=20 bytes
                                          writeBin(unitInfo[1],
                                                   con = con, size = 2, endian = "little")
                                          writeBin(unitInfo[2],
@@ -228,7 +228,7 @@
                                              # Number of bytes: 64 bytes
                                              suppressWarnings({
                                                writeChar(as.character(names(unit$groups)[igroup]),
                                     -                    con = con, nchars = 64, eos = NULL)
                                     +                    con = con, nchars = 64, eos = NULL)
                                              })
                                              ## Writing each cell in turn
@@ -302,7 +302,7 @@
                                      ############################################################################
                                      # HISTORY:
                                      # 2007-07-13 /HB
                                     -# o While writing unit names in .initializeCdf(), quite a few copies were
                                     +# o While writing unit names in .initializeCdf(), quite a few copies were
                                      #   created using up a lot of memory.  By removing unused objects and
                                      #   writing unit names in chunks memory usage is now stable and < 200MB.
                                      # 2007-02-01 /HB
@@ -314,7 +314,7 @@
                                      # o Added writeCdfHeader(), writeCdfQcUnits() and writeCdfUnits().  With
                                      #   these it is now possible to build up the CDF in chunks.
                                      # o Removed obsolete arguments 'addName' and 'addPositions' and all related
                                     -#   code.  Internal variable 'positions' is not needed anymore.
                                     +#   code.  Internal variable 'positions' is not needed anymore.
                                      #   There are no more seek():s in the code.
                                      # o Removed obsolete .writeCdfUnit2().
                                      # o Now only every 1000th unit (instead of 100th) is reported. It is now

Browse code

Version: 1.9.2 [2007-07-27] o Optimized writeCdfHeader() for memory. For a CDF with 1,200,000+ units just writing the unit names would consume 1-1.5GB RAM. Now it writes unit names in chunks keeping the memory overhead around 100-200MB. o Made convertCdf() more memory efficient. o BUG FIX: The error message in isCelFile() when the file was not found was broken. o Updated to v1.9.2 on BioC devel.

Version: 1.8.1 [2007-07-26]
o Now affxparser install on OSX with PPC.

Version: 1.7.6 [2007-03-28] (never committed until v1.9.2)
o Modified findCdf() such that it is possible to set an alternative
function for how CDFs are located.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@26016 bc3139a8-67e5-0310-9ffc-ced21a209358

Henrik Bengtsson authored on 28/07/2007 10:06:17
Showing 1 changed files

R/writeCdf.private.R

History View file @ f06bec1

@@ -34,6 +34,8 @@
                                          # 7 The length of the resequencing reference sequence.        [integer]
                                          # 8 The resequencing reference sequence.                    [char[len]]
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    offset <- 0;
+                                    +
                                          ## Magic number and version number
                                          writeBin(object = as.integer(c(67, 1)),
                                                   con = con, size = 4, endian = "little")
@@ -46,12 +48,25 @@
                                          ## Length of refSeqsequence
                                          writeBin(object = as.integer(lrefSeq),
                                                   con = con, size = 4, endian = "little")
                                     +    offset <- 24;
+                                    +
                                     +    fOffset <- seek(con=con, origin="start", rw="write");
                                     +    if (offset != fOffset) {
                                     +      throw("File format write error (step 1): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +    }
+                                    +
                                          ## RefSeqsequece
                                          if(lrefSeq > 0)
                                            writeChar(as.character(refSeq), con=con, eos=NULL);
                                          # Current offset
                                     -    offset <- 24 + lrefSeq;
                                     +    offset <- offset + lrefSeq;
+                                    +
                                     +    fOffset <- seek(con=con, origin="start", rw="write");
                                     +    if (offset != fOffset) {
                                     +      throw("File format write error (step 2): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +    }
+                                    +
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                          # Unit names
@@ -61,17 +76,60 @@
                                          # write to raw and then replace '\xFF' with '\0'. Thus, unit names with
                                          # '\xFF' are invalid, but this should not be a real problem.
                                          pads <- sapply(0:64, FUN=function(x) paste(rep("\xFF", x), collapse=""));
                                     -    unitnames <- paste(unitnames, pads[64-nchar(unitnames)], sep="");
                                     -    raw <- raw(64*length(unitnames));
                                     -    raw <- writeBin(con=raw, unitnames, size=1);
                                     -    raw[raw == as.raw(255)] <- as.raw(0);
                                     -    writeBin(con=con, raw);
                                     -    rm(raw);
+                                    +
                                     +    # Write the unit names in chunks to save memory
                                     +    nbrOfUnits <- length(unitnames);
                                     +    chunkSize <- 100000;
                                     +    nbrOfChunks <- ceiling(nbrOfUnits / chunkSize);
+                                    +
                                     +    # Allocate raw vector
                                     +    raw <- raw(64*chunkSize);
+                                    +
                                     +    for (kk in 1:nbrOfChunks) {
                                     +      # Units for this chunk
                                     +      from <- (kk-1)*chunkSize+1;
                                     +      to <- min(from+chunkSize-1, nbrOfUnits);
                                     +      unitnamesFF <- unitnames[from:to];
+                                    +
                                     +      # Pad the unit names
                                     +      unitnamesFF <- paste(unitnamesFF, pads[64-nchar(unitnamesFF)], sep="");
+                                    +
                                     +      # Truncate last chunk?
                                     +      if (chunkSize > length(unitnamesFF)) {
                                     +        raw <- raw[1:(64*length(unitnamesFF))];
                                     +      }
+                                    +
                                     +      # Write unit names to raw vector
                                     +      raw <- writeBin(con=raw, unitnamesFF, size=1);
+                                    +
                                     +      rm(unitnamesFF);
+                                    +
                                     +      # Garbage collect
                                     +#      gc <- gc();
                                     +#      print(gc);
+                                    +
                                     +      # Replace all '\xFF' with '\0'.
                                     +      idxs <- which(raw == as.raw(255));
                                     +      raw[idxs] <- as.raw(0);
                                     +      rm(idxs);
+                                    +
                                     +      writeBin(con=con, raw);
                                     +   } # for (kk in ...)
+                                    +
                                     +   rm(raw);
                                     +   # Garbage collect
                                     +   gc <- gc();
+                                    +
                                      #    writeChar(con=con, as.character(unitnames), nchars=rep(64, nUnits), eos=NULL)
                                          bytesOfUnitNames <- 64 * nUnits;
                                          offset <- offset + bytesOfUnitNames;
                                     +    fOffset <- seek(con=con, origin="start", rw="write");
                                     +    if (offset != fOffset) {
                                     +      throw("File format write error (step 3): File offset is not the excepted one: ", fOffset, " != ", offset);
                                     +    }
+                                    +
                                          bytesOfQcUnits <- 4 * nQcUnits;
                                          offset <- offset + bytesOfQcUnits;
@@ -243,6 +301,10 @@
                                      ############################################################################
                                      # HISTORY:
                                     +# 2007-07-13 /HB
                                     +# o While writing unit names in .initializeCdf(), quite a few copies were
                                     +#   created using up a lot of memory.  By removing unused objects and
                                     +#   writing unit names in chunks memory usage is now stable and < 200MB.
                                      # 2007-02-01 /HB
                                      # o Updated to camel case as much as possible to match JBs updates in the
                                      #   branch.

Browse code

o Added missing Rd files. o Converged the method API to the same camel case changes that James Bullard did in the devel branch for those methods that we happened to work on in parallel. o Will commit added/modified changes to the devel branch too.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@22355 bc3139a8-67e5-0310-9ffc-ced21a209358

Henrik Bengtsson authored on 01/02/2007 19:02:01
Showing 1 changed files

R/writeCdf.private.R

History View file @ 45b8a9b

@@ -1,35 +1,25 @@
                                     -.initializeCdf <- function(con, nrows = 1, ncols = 1,
                                     -                          nunits = 1, nqcunits = 0,
                                     -                          refseq = "",
                                     -                          unitnames = rep("", nunits),
                                     -                          qcunitpositions = rep(1, nqcunits),
                                     -                          unitpositions = rep(2, nunits),
                                     -                          qcUnitLengths = rep(0, nqcunits),
                                     -                          unitLengths = rep(0, nunits),
                                     +.initializeCdf <- function(con, nRows = 1, nCols = 1,
                                     +                          nUnits = 1, nQcUnits = 0,
                                     +                          refSeq = "",
                                     +                          unitnames = rep("", nUnits),
                                     +                          qcUnitLengths = rep(0, nQcUnits),
                                     +                          unitLengths = rep(0, nUnits),
                                                                ...) {
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                          # Validate arguments
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    if(length(qcunitpositions) != nqcunits) {
                                     -      stop("Number of elements in argument 'qcunitpositions' does not match 'nqcunits'");
                                     +    if(length(qcUnitLengths) != nQcUnits) {
                                     +      stop("Number of elements in argument 'qcUnitLengths' does not match 'nQcUnits'");
+                                         }
                                     -    if(length(unitpositions) != nunits) {
                                     -      stop("Number of elements in argument 'unitpositions' does not match 'nunits'");
                                     +    if(length(unitLengths) != nUnits) {
                                     +      stop("Number of elements in argument 'qcUnitLengths' does not match 'nUnits'");
+                                         }
                                     -    if(length(qcUnitLengths) != nqcunits) {
                                     -      stop("Number of elements in argument 'qcUnitLengths' does not match 'nqcunits'");
                                     -    }
+                                    -
                                     -    if(length(unitLengths) != nunits) {
                                     -      stop("Number of elements in argument 'qcUnitLengths' does not match 'nunits'");
                                     -    }
+                                    -
                                     -    if(length(refseq) != 1)
                                     -        stop("Argument 'refseq' should be a single character.");
                                     +    if(length(refSeq) != 1)
                                     +        stop("Argument 'refSeq' should be a single character.");
                                     -    lrefseq <- nchar(refseq);
                                     +    lrefSeq <- nchar(refSeq);
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                          # CDF header
@@ -47,21 +37,21 @@
                                          ## Magic number and version number
                                          writeBin(object = as.integer(c(67, 1)),
                                                   con = con, size = 4, endian = "little")
                                     -    ## Ncols, Nrows
                                     -    writeBin(object = as.integer(c(ncols, nrows)),
                                     +    ## NCols, NRows
                                     +    writeBin(object = as.integer(c(nCols, nRows)),
                                                   con = con, size = 2, endian = "little")
                                          ## NumberUnits, NumberQCUnits
                                     -    writeBin(object = as.integer(c(nunits, nqcunits)),
                                     +    writeBin(object = as.integer(c(nUnits, nQcUnits)),
                                                   con = con, size = 4, endian = "little")
                                     -    ## Length of refseqsequence
                                     -    writeBin(object = as.integer(lrefseq),
                                     +    ## Length of refSeqsequence
                                     +    writeBin(object = as.integer(lrefSeq),
                                                   con = con, size = 4, endian = "little")
                                     -    ## Refseqsequece
                                     -    if(lrefseq > 0)
                                     -      writeChar(as.character(refseq), con=con, eos=NULL);
                                     +    ## RefSeqsequece
                                     +    if(lrefSeq > 0)
                                     +      writeChar(as.character(refSeq), con=con, eos=NULL);
                                          # Current offset
                                     -    offset <- 24 + lrefseq;
                                     +    offset <- 24 + lrefSeq;
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                          # Unit names
@@ -77,24 +67,24 @@
                                          raw[raw == as.raw(255)] <- as.raw(0);
                                          writeBin(con=con, raw);
                                          rm(raw);
                                     -#    writeChar(con=con, as.character(unitnames), nchars=rep(64, nunits), eos=NULL)
                                     +#    writeChar(con=con, as.character(unitnames), nchars=rep(64, nUnits), eos=NULL)
                                     -    bytesOfUnitNames <- 64 * nunits;
                                     +    bytesOfUnitNames <- 64 * nUnits;
                                          offset <- offset + bytesOfUnitNames;
                                     -    bytesOfQcUnits <- 4 * nqcunits;
                                     +    bytesOfQcUnits <- 4 * nQcUnits;
                                          offset <- offset + bytesOfQcUnits;
                                     -    bytesOfUnits <- 4 * nunits;
                                     +    bytesOfUnits <- 4 * nUnits;
                                          offset <- offset + bytesOfUnits;
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                          # QC units file positions
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     -    if (nqcunits > 0) {
                                     +    if (nQcUnits > 0) {
                                            csum <- cumsum(qcUnitLengths);
                                     -      nextOffset <- csum[nqcunits];
                                     -      starts <- c(0, csum[-nqcunits]);
                                     +      nextOffset <- csum[nQcUnits];
                                     +      starts <- c(0, csum[-nQcUnits]);
                                            starts <- as.integer(offset + starts);
                                            writeBin(starts, con = con, size = 4, endian = "little")
                                          } else {
@@ -108,10 +98,10 @@
                                          # Units file positions
                                          # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                          offset <- offset + nextOffset;
                                     -    if (nunits > 0) {
                                     +    if (nUnits > 0) {
                                            csum <- cumsum(unitLengths);
                                     -      nextOffset <- csum[nunits];
                                     -      starts <- c(0, csum[-nunits]);
                                     +      nextOffset <- csum[nUnits];
                                     +      starts <- c(0, csum[-nUnits]);
                                            starts <- as.integer(offset + starts);
                                            writeBin(starts, con = con, size = 4, endian = "little");
                                          } else {
@@ -124,23 +114,23 @@
                                          ## 3. Write the unit
                                          unitTypes <- c(expression=1, genotyping=2, tag=3,
                                                                                   resequencing=4, unknown=5);
                                     -    unittype <- unitTypes[unit$unittype];
                                     +    unitType <- unitTypes[unit$unittype];
                                          unitDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     -    unitdirection <- unitDirections[unit$unitdirection];
                                     +    unitDirection <- unitDirections[unit$unitdirection];
                                     -    unittype <- switch(unit$unittype,
                                     +    unitType <- switch(unit$unittype,
                                                             expression = 1,
                                                             genotyping = 2,
                                                             tag = 3,
                                                             resequencing = 4,
                                                             unknown = 5)
                                     -    unitdirection <- switch(unit$unitdirection,
                                     +    unitDirection <- switch(unit$unitdirection,
                                                                  nodirection = 0,
                                                                  sense = 1,
                                                                  antisense = 2,
                                                                  unknown = 3)
                                     -    unitInfo <- as.integer(c(unittype, unitdirection,
                                     +    unitInfo <- as.integer(c(unitType, unitDirection,
                                                                   unit$natoms, length(unit$groups),
                                                                   unit$ncells, unit$unitnumber,
                                                                   unit$ncellsperatom))
@@ -160,15 +150,15 @@
                                          groupDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                          for(igroup in seq(along.with = unit$groups)) {
                                              group <- unit$groups[[igroup]]
                                     -        groupdirection <- groupDirections[group$groupdirection];
                                     -        groupdirection <- switch(group$groupdirection,
                                     +        groupDirection <- groupDirections[group$groupdirection];
                                     +        groupDirection <- switch(group$groupdirection,
                                                                       nodirection = 0,
                                                                       sense = 1,
                                                                       antisense = 2,
                                                                       unknown = 3)
                                              groupInfo <- as.integer(c(group$natoms, length(group$x),
                                                                        group$ncellsperatom,
                                     -                                  groupdirection, min(group$atoms, 0)))
                                     +                                  groupDirection, min(group$atoms, 0)))
                                             # Number of bytes: 2*4+2*1+2*4=18 bytes
                                              writeBin(groupInfo[1:2],
                                                       con = con, size = 4, endian = "little")
@@ -239,8 +229,8 @@
                                          writeBin(qcunitInfo[2], con = con, size = 4, endian = "little")
                                          # Write 2 + 4 bytes
                                     -    ncells <- length(qcunit$x);
                                     -    nbrOfBytes <- 7*ncells;
                                     +    nCells <- length(qcunit$x);
                                     +    nbrOfBytes <- 7*nCells;
                                          cells <- matrix(as.integer(c(qcunit$x, qcunit$y, qcunit$length,
                                                                       qcunit$pm, qcunit$background)),
                                                          ncol = 5)
@@ -253,6 +243,11 @@
                                      ############################################################################
                                      # HISTORY:
                                     +# 2007-02-01 /HB
                                     +# o Updated to camel case as much as possible to match JBs updates in the
                                     +#   branch.
                                     +# o Removed non-used arguments 'unitpositions' and 'qcunitpositions' from
                                     +#   .initializeCdf().
                                      # 2007-01-10 /HB
                                      # o Added writeCdfHeader(), writeCdfQcUnits() and writeCdfUnits().  With
                                      #   these it is now possible to build up the CDF in chunks.
@@ -267,13 +262,13 @@
                                      #   with other code, pursuant to communication from KH.
                                      # 2006-10-25 /HB (+KS)
                                      # o BUG FIX: .initializeCdf() was writing false file offset for QC units
                                     -#   when the number QC nunits were zero.  This would core dump readCdfNnn().
                                     +#   when the number QC nUnits were zero.  This would core dump readCdfNnn().
                                      # 2006-09-21 /HB
                                      # o BUG FIX: The 'atom' and 'indexpos' fields were swapped.
                                      # o Now suppressing warnings "writeChar: more characters requested..." in
                                      #   writeCdf().
                                      # 2006-09-11 /HB
                                     -# o BUG FIX: nrows & ncols were swapped in the CDF header.
                                     +# o BUG FIX: nRows & nCols were swapped in the CDF header.
                                      # 2006-09-09 /HB
                                      # o Updated writeCdf() has been validate with compareCdfs() on a few arrays.
                                      # o With the below "optimizations" writeCdf() now writes Hu6800.CDF with

Browse code

o Added writeCdfHeader(), writeCdfQcUnits() and writeCdfUnits(). These are all used by writeCdf(). They also make it possible to write a CDF in chunks in order to for instance convertCdf() in constant memory. These functions still need to be documented.

git-svn-id: file:///home/git/hedgehog.fhcrc.org/bioconductor/trunk/madman/Rpacks/affxparser@21888 bc3139a8-67e5-0310-9ffc-ced21a209358

Henrik Bengtsson authored on 09/01/2007 08:44:30
Showing 1 changed files

R/writeCdf.private.R

History View file @ 1b71034

                                     new file mode 100644
@@ -0,0 +1,295 @@
                                     +.initializeCdf <- function(con, nrows = 1, ncols = 1,
                                     +                          nunits = 1, nqcunits = 0,
                                     +                          refseq = "",
                                     +                          unitnames = rep("", nunits),
                                     +                          qcunitpositions = rep(1, nqcunits),
                                     +                          unitpositions = rep(2, nunits),
                                     +                          qcUnitLengths = rep(0, nqcunits),
                                     +                          unitLengths = rep(0, nunits),
                                     +                          ...) {
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    # Validate arguments
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    if(length(qcunitpositions) != nqcunits) {
                                     +      stop("Number of elements in argument 'qcunitpositions' does not match 'nqcunits'");
                                     +    }
+                                    +
                                     +    if(length(unitpositions) != nunits) {
                                     +      stop("Number of elements in argument 'unitpositions' does not match 'nunits'");
                                     +    }
+                                    +
                                     +    if(length(qcUnitLengths) != nqcunits) {
                                     +      stop("Number of elements in argument 'qcUnitLengths' does not match 'nqcunits'");
                                     +    }
+                                    +
                                     +    if(length(unitLengths) != nunits) {
                                     +      stop("Number of elements in argument 'qcUnitLengths' does not match 'nunits'");
                                     +    }
+                                    +
                                     +    if(length(refseq) != 1)
                                     +        stop("Argument 'refseq' should be a single character.");
+                                    +
                                     +    lrefseq <- nchar(refseq);
+                                    +
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    # CDF header
                                     +    #
                                     +    # 1 Magic number. Always set to 67.                           [integer]
                                     +    # 2 Version number.                                           [integer]
                                     +    # 3 The number of columns of cells on the array.       [unsigned short]
                                     +    # 4 The number of rows of cells on the array.          [unsigned short]
                                     +    # 5 The number of units in the array not including QC units. The term
                                     +    #   unit is an internal term which means probe set.           [integer]
                                     +    # 6 The number of QC units.                                   [integer]
                                     +    # 7 The length of the resequencing reference sequence.        [integer]
                                     +    # 8 The resequencing reference sequence.                    [char[len]]
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    ## Magic number and version number
                                     +    writeBin(object = as.integer(c(67, 1)),
                                     +             con = con, size = 4, endian = "little")
                                     +    ## Ncols, Nrows
                                     +    writeBin(object = as.integer(c(ncols, nrows)),
                                     +             con = con, size = 2, endian = "little")
                                     +    ## NumberUnits, NumberQCUnits
                                     +    writeBin(object = as.integer(c(nunits, nqcunits)),
                                     +             con = con, size = 4, endian = "little")
                                     +    ## Length of refseqsequence
                                     +    writeBin(object = as.integer(lrefseq),
                                     +             con = con, size = 4, endian = "little")
                                     +    ## Refseqsequece
                                     +    if(lrefseq > 0)
                                     +      writeChar(as.character(refseq), con=con, eos=NULL);
+                                    +
                                     +    # Current offset
                                     +    offset <- 24 + lrefseq;
+                                    +
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    # Unit names
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    # Write to raw vector (2*10^6 units => 122Mb; should be ok for now)
                                     +    # Since we can't create strings with '\0':s, we use '\xFF',
                                     +    # write to raw and then replace '\xFF' with '\0'. Thus, unit names with
                                     +    # '\xFF' are invalid, but this should not be a real problem.
                                     +    pads <- sapply(0:64, FUN=function(x) paste(rep("\xFF", x), collapse=""));
                                     +    unitnames <- paste(unitnames, pads[64-nchar(unitnames)], sep="");
                                     +    raw <- raw(64*length(unitnames));
                                     +    raw <- writeBin(con=raw, unitnames, size=1);
                                     +    raw[raw == as.raw(255)] <- as.raw(0);
                                     +    writeBin(con=con, raw);
                                     +    rm(raw);
                                     +#    writeChar(con=con, as.character(unitnames), nchars=rep(64, nunits), eos=NULL)
+                                    +
                                     +    bytesOfUnitNames <- 64 * nunits;
                                     +    offset <- offset + bytesOfUnitNames;
+                                    +
                                     +    bytesOfQcUnits <- 4 * nqcunits;
                                     +    offset <- offset + bytesOfQcUnits;
+                                    +
                                     +    bytesOfUnits <- 4 * nunits;
                                     +    offset <- offset + bytesOfUnits;
+                                    +
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    # QC units file positions
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    if (nqcunits > 0) {
                                     +      csum <- cumsum(qcUnitLengths);
                                     +      nextOffset <- csum[nqcunits];
                                     +      starts <- c(0, csum[-nqcunits]);
                                     +      starts <- as.integer(offset + starts);
                                     +      writeBin(starts, con = con, size = 4, endian = "little")
                                     +    } else {
                                     +      nextOffset <- 0;
                                     +#      starts <- 0;
                                     +#      starts <- as.integer(offset + starts);
                                     +#      writeBin(starts, con = con, size = 4, endian = "little")
                                     +    }
+                                    +
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    # Units file positions
                                     +    # - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
                                     +    offset <- offset + nextOffset;
                                     +    if (nunits > 0) {
                                     +      csum <- cumsum(unitLengths);
                                     +      nextOffset <- csum[nunits];
                                     +      starts <- c(0, csum[-nunits]);
                                     +      starts <- as.integer(offset + starts);
                                     +      writeBin(starts, con = con, size = 4, endian = "little");
                                     +    } else {
                                     +      nextOffset <- 0;
                                     +    }
                                     +} # .initializeCdf()
+                                    +
+                                    +
                                     +.writeCdfUnit <- function(unit, con, unitname=NULL) {
                                     +    ## 3. Write the unit
                                     +    unitTypes <- c(expression=1, genotyping=2, tag=3,
                                     +                                             resequencing=4, unknown=5);
                                     +    unittype <- unitTypes[unit$unittype];
                                     +    unitDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     +    unitdirection <- unitDirections[unit$unitdirection];
+                                    +
                                     +    unittype <- switch(unit$unittype,
                                     +                       expression = 1,
                                     +                       genotyping = 2,
                                     +                       tag = 3,
                                     +                       resequencing = 4,
                                     +                       unknown = 5)
                                     +    unitdirection <- switch(unit$unitdirection,
                                     +                            nodirection = 0,
                                     +                            sense = 1,
                                     +                            antisense = 2,
                                     +                            unknown = 3)
+                                    +
                                     +    unitInfo <- as.integer(c(unittype, unitdirection,
                                     +                             unit$natoms, length(unit$groups),
                                     +                             unit$ncells, unit$unitnumber,
                                     +                             unit$ncellsperatom))
+                                    +
                                     +    # Number of bytes: 2+1+4*4+1=20 bytes
                                     +    writeBin(unitInfo[1],
                                     +             con = con, size = 2, endian = "little")
                                     +    writeBin(unitInfo[2],
                                     +             con = con, size = 1, endian = "little")
                                     +    writeBin(unitInfo[3:6],
                                     +             con = con, size = 4, endian = "little")
                                     +    writeBin(unitInfo[7],
                                     +             con = con, size = 1, endian = "little")
+                                    +
                                     +    ## Writing each group in turn
                                     +    # Number of bytes: (18+64)*nbrOfGroups + 14*totalNbrOfCells bytes
                                     +    groupDirections <- c(nodirection=0, sense=1, antisense=2, unknown=3);
                                     +    for(igroup in seq(along.with = unit$groups)) {
                                     +        group <- unit$groups[[igroup]]
                                     +        groupdirection <- groupDirections[group$groupdirection];
                                     +        groupdirection <- switch(group$groupdirection,
                                     +                                 nodirection = 0,
                                     +                                 sense = 1,
                                     +                                 antisense = 2,
                                     +                                 unknown = 3)
                                     +        groupInfo <- as.integer(c(group$natoms, length(group$x),
                                     +                                  group$ncellsperatom,
                                     +                                  groupdirection, min(group$atoms, 0)))
                                     +       # Number of bytes: 2*4+2*1+2*4=18 bytes
                                     +        writeBin(groupInfo[1:2],
                                     +                 con = con, size = 4, endian = "little")
                                     +        writeBin(groupInfo[3:4],
                                     +                 con = con, size = 1, endian = "little")
                                     +        writeBin(groupInfo[5:6],
                                     +                 con = con, size = 4, endian = "little")
+                                    +
                                     +        # Number of bytes: 64 bytes
                                     +        suppressWarnings({
                                     +          writeChar(as.character(names(unit$groups)[igroup]),
                                     +                    con = con, nchars = 64, eos = NULL)
                                     +        })
+                                    +
                                     +        ## Writing each cell in turn
                                     +#        cells <- matrix(as.integer(c(group$atom, group$x,
                                     +#                                     group$y, group$indexpos)),
                                     +#                        ncol = 4)
                                     +        cells <- matrix(as.integer(c(group$indexpos, group$x,
                                     +                                     group$y, group$atom)),
                                     +                        ncol = 4)
+                                    +
                                     +        # Number of bytes: 14*nbrOfCells bytes
                                     +        for(icell in seq(along.with = group$x)) {
                                     +            # Number of bytes: 1*4+2*2+1*4+1*2=14 bytes
                                     +            writeBin(cells[icell, 1],
                                     +                     con = con, size = 4, endian = "little")
                                     +            writeBin(cells[icell, 2:3],
                                     +                     con = con, size = 2, endian = "little")
                                     +            writeBin(cells[icell, 4],
                                     +                     con = con, size = 4, endian = "little")
                                     +            writeChar(as.character(c(group$pbase[icell],
                                     +                                     group$tbase[icell])),
                                     +                      con = con, nchars = c(1,1), eos = NULL)
                                     +        }
                                     +    }
                                     +} # .writeCdfUnit()
+                                    +
+                                    +
+                                    +
                                     +.writeCdfQcUnit <- function(qcunit, con) {
                                     +    ## 2. Actually write the qcunit
                                     +    type <- switch(qcunit$type,
                                     +                   unknown = 0,
                                     +                   checkerboardNegative = 1,
                                     +                   checkerboardPositive = 2,
                                     +                   hybeNegative = 3,
                                     +                   hybePositive = 4,
                                     +                   textFeaturesNegative = 5,
                                     +                   textFeaturesPositive = 6,
                                     +                   centralNegative = 7,
                                     +                   centralPositive = 8,
                                     +                   geneExpNegative = 9,
                                     +                   geneExpPositive = 10,
                                     +                   cycleFidelityNegative = 11,
                                     +                   cycleFidelityPositive = 12,
                                     +                   centralCrossNegative = 13,
                                     +                   centralCrossPositive = 14,
                                     +                   crossHybeNegative = 15,
                                     +                   crossHybePositive = 16,
                                     +                   SpatialNormNegative = 17,
                                     +                   SpatialNormPositive = 18)
+                                    +
                                     +    # Write 2 + 4 bytes
                                     +    nbrOfBytes <- 6;
                                     +    qcunitInfo <- as.integer(c(type, qcunit$ncells))
                                     +    writeBin(qcunitInfo[1], con = con, size = 2, endian = "little")
                                     +    writeBin(qcunitInfo[2], con = con, size = 4, endian = "little")
+                                    +
                                     +    # Write 2 + 4 bytes
                                     +    ncells <- length(qcunit$x);
                                     +    nbrOfBytes <- 7*ncells;
                                     +    cells <- matrix(as.integer(c(qcunit$x, qcunit$y, qcunit$length,
                                     +                                 qcunit$pm, qcunit$background)),
                                     +                    ncol = 5)
                                     +    for(icell in seq(along.with = qcunit$x)) {
                                     +        writeBin(cells[icell, 1:2], con = con, size = 2, endian = "little")
                                     +        writeBin(cells[icell, 3:5], con = con, size = 1, endian = "little")
                                     +    }
                                     +} # .writeCdfQcUnit()
+                                    +
+                                    +
                                     +############################################################################
                                     +# HISTORY:
                                     +# 2007-01-10 /HB
                                     +# o Added writeCdfHeader(), writeCdfQcUnits() and writeCdfUnits().  With
                                     +#   these it is now possible to build up the CDF in chunks.
                                     +# o Removed obsolete arguments 'addName' and 'addPositions' and all related
                                     +#   code.  Internal variable 'positions' is not needed anymore.
                                     +#   There are no more seek():s in the code.
                                     +# o Removed obsolete .writeCdfUnit2().
                                     +# o Now only every 1000th unit (instead of 100th) is reported. It is now
                                     +#   also a count down.
                                     +# 2006-12-18 /KS
                                     +# o Make global replacement "block" -> "group" to maintain consistency
                                     +#   with other code, pursuant to communication from KH.
                                     +# 2006-10-25 /HB (+KS)
                                     +# o BUG FIX: .initializeCdf() was writing false file offset for QC units
                                     +#   when the number QC nunits were zero.  This would core dump readCdfNnn().
                                     +# 2006-09-21 /HB
                                     +# o BUG FIX: The 'atom' and 'indexpos' fields were swapped.
                                     +# o Now suppressing warnings "writeChar: more characters requested..." in
                                     +#   writeCdf().
                                     +# 2006-09-11 /HB
                                     +# o BUG FIX: nrows & ncols were swapped in the CDF header.
                                     +# 2006-09-09 /HB
                                     +# o Updated writeCdf() has been validate with compareCdfs() on a few arrays.
                                     +# o With the below "optimizations" writeCdf() now writes Hu6800.CDF with
                                     +#   units in 130s compared to 140s.
                                     +# o Now initializeCdf() dumps all unit names at once by first building a
                                     +#   raw vector.  This is now much faster than before.
                                     +# o Now writeCdf() does not seek() around in the file anymore.  This should
                                     +#   speed up writing at least a bit.
                                     +# o Made some optimization, which speeds up the writing a bit.  Jumping
                                     +#   around in the file with seek() is expensive and should be avoided.
                                     +# o Rename writeUnit() to writeCdfUnit() and same for the QC function.
                                     +# o Added more verbose output and better errror messages for writeCdf().
                                     +# 2006-09-07 /HB
                                     +# o Maybe initalizeCdf(), writeUnit(), and writeQcUnit() should be made
                                     +#   private functions of this package.
                                     +# o Removed textCdf2binCdf() skeleton. See convertCdf() instead.
                                     +# o Updated writeCdf() such that the connection is guaranteed to be closed
                                     +#   regardless.
                                     +############################################################################