Help : Help Entering Results for a Single Array
Contents
The
Data Entry for
Microarray Experiment form is used to enter experiments into the
database one at a time. This help describes how to fill out the form
to enter an experiment. Experiments can also be entered by batch, and
a separate
batch help is
available for that procedure.
It is assumed you've already read First Time Users and What You Need from the
previous help file, Entering Results
from Experiments
A "Result Set" is one version of the data for a given hybridization.
The Agilent and Affymetrix/dChip software will allow you to set
various parameters, producing somewhat different numbers. That is,
you may scan a slide once (defining a "slide" and "experiment" for the
purposes of the database), but choose several different ways to
normalize or compute the extracted data, producing multiple result
sets for the single slide. Additionally, the Affymetrix cell file is
considered a separate result set for Affymetrix arrays (you should
have exactly one cell file result set, and one or more data file
result sets, per Affymetrix array).
- Enter the name for the result set and a description specific to
the set; e.g., details of the normalization, or the software used
(say, MAS 5 vs. GCOS vs. dChip). Information describing the
experiment or encompassing all result sets (e.g., for Affymetrix, the
cell data and every Probe Set intensity set) is entered in the
Experiment Description and Details section.
- Determine the print your experiment uses and select it from the
pop-up list. Clicking on Print
Name will get you information about these prints. If you are
unsure about which print to use, contact the microarray database
curators.
- Enter a unique name for the slide used in this experiment in the
Slide Name box. A systematic slide name, usually dictated by the array
producer, is preferred (unique).
Note: All text entry on the forms is case
sensitive. Also, please avoid the use of single and double quotes
('a' and "a") in your entries.
- Select the files from the pull-down menu containing data for
your experiment. These files must exist in
the Stanford Department of Genetics UNIX system. To transfer files
from a PC to this system, ftp to loader.stanford.edu (you must already
have an account on loader). After you have logged into your directory,
go to your incoming directory and place your data files there. The incoming directory in your own account on loader is the only place from which you can load data.
Please make sure that you transfer your .gpr, .dat, .srr, .CEL, .xls, and .txt files as
ASCII text and the .gps, .sag, .sra, .tif, and Affymetrix .DAT files as
binary or "raw" data. Remember that we now accept compressed (.gz) files. For more information about transferring files, refer to the Entering Results from Experiments document.
Experiment Description And Details Section
- Enter a date you want associated with the experiment; the date of
hybridization is recommended. The date format is 4-digit year, 2-digit month, and 2-digit day (YYYY-MM-DD).
- For Experiment Name, enter a unique, descriptive name for your
experiment. The maximum length is 100 characters.
- You have space to enter 2000 characters of text description for
experiment. This is optional and is intended for useful notes, such as a
description of the methods used in sample preparation. There are
separate description fields below to specifically identify the red and green channels.
- Choose an experiment category and subcategory. If you need
more information about categories
or subcategories,
click on the links to the left of the pop-up menus. If you need new
categories or subcategories created, contact the microarray database curators.
- Enter short descriptions of the samples used for the green and red channels in this experiment. The maximum length is 100 characters, and descriptions are usually phrases such as "reference pool", or "nitrogen starvation, 2 days".
- If you wish to define your own normalization, check 'User
Defined' and enter a normalization value in the box provided. Otherwise,
accept the default-computed
normalization ('Computed' on the form) or select 'Using regression
correlation' if you wish to use this method. Normalization type is
required for entering Agilent Affymetrix data, but is ignored.
Array Access Section
- Enter the login name of the person who
performed the experiment in the Experimenter box. This will be
the only person, save the curators, with the ability to edit or delete
the array. By default, your entire lab group will automatically have view access to your experiment.
- Choose who will be able to see the data from your array using the
multiple-select scrolling lists under Collaborative Group. You may
select one or more Collaborative Groups and/or one or more individual
users who will be able to view your data. (To select multiple choices
in a list, hold down the command/apple key on MacOS or the control key
on a Windows while you click. With Unix, just click on multiple
names.) Lists of collaborative
groups and individual users
can be obtained by clicking the links to the left of the scrolling
menus or by clicking "User group" or "Users" under "List Data" on the
main page.
- Click the "Load Experiment into Database" button to enter your
experiment.
Monitoring Your Request as it Progresses Within the Queue
Experiment loading is commenced by entering your loading-data into a
queue. The rate of loading is determined by a number of factors,
including both the load on the database and how many other array-load
requests were made prior to yours. If there are no delays, it usually
takes at least five minutes per array, and may take
quite a bit longer if your arrays have a large number of spots (human arrays) or
if many other users are using the database.
During this time, you can check the progress of your experiment load
within the queue. After your data is successfully entered into the
queue (Note: this is not the same thing as final entry into the
database), you should receive a confirmation screen as well as an
email notifying you:
Your database entry request (batch number XXXX) has been
queued for loading.
Please note the data for your array(s) ARE NOT YET IN THE
DATABASE. Do NOT delete any of your files until you receive email
confirmation that the data have been loaded.
Progress of is batch within the queue can be viewed at:
http://genome-www5.stanford.edu/cgi-bin/tools/queue/nph-ProgressQuery?batchno=XXXX
You may also view this file in your loader account under the ORA-OUT directory.
If you have any questions please contact the curators
array@genome.stanford.edu)
You can check the progress
of your experiment load based on the batch number reported to you
with either the link on the queue confirmation page or from the URL in
the email.
Successful Result Entry into the Database
If all goes well, you will eventually get an email
message that says:
Loading of your array data (batch number XXXX), has
successfully completed. 1 out of 1 were successfully loaded.
Details of the load process have been written to :
/loader/ftphome/username/ORA-OUT/XXXX.log,
or you can temporarily view the details via the web at:
http://genome-www5.stanford.edu/cgi-bin/tools/queue/nph-ProgressQuery?batchno=XXXX
If you have any questions please contact the curators
(array@genome.stanford.edu).
At the bottom of the HTML confirmation page or in the log file in your ORA-OUT directory on loader.stanford.edu should be the message:
***All data for this experiment ('slidename') have been successfully inserted into oracle database***
Common Problems
- If your results have not been loaded 1 day after entry into the
queue, please notify the microarray database curators.
- File location: All files must be in the incoming directory on your account for loader.stanford.edu.
- UNIX file names: The names of your uploaded files
should not
contain spaces, or any of
the following characters:
'
"
#
,
/
\
?
<
>
;
:
!
@
%
^
&
*
(
)
- Occasionally, we backup and re-index the database. This process
can significantly the delay the loading of data (and vice versa). We
suggest not loading during these time periods. Consult the Scheduled
Database Backups page for the times to avoid.
- Sometimes .tif -> .gif conversion fails. Please check your loaded
arrays by displaying them and verifying the clickable-gif. If you need to replace the gif file that we have created for you, please see our help documentation for this. If their
is no clickable-gif icon present, contact the microarray database curators.
- Errors? What errors? Shortly after a queue batch request
is processed (successful or not), you will no longer be able
monitor its status within the queue (as it has been removed,
and its web-log with it). However, just check your ORA-OUT directory on loader.stanford.edu to see the text log-file of the database entry.
Please send comments or questions to:
array@genome.stanford.edu