@@ -11,8 +11,28 @@ A graphical user interface (GUI) is provided to allow the user to select the inp
11
11
The results browser displays the protein sequences, highlighting the residues that were present
12
12
in the peptide input file, and providing sequence coverage stats for each protein.
13
13
14
+ ## Example files:
15
+
16
+ Input files:
17
+ * [ BSA_P171_QID1638_TestPeptides.txt] ( https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides.txt )
18
+ * [ BSA_P171_QID1638_TestPeptides_SequenceOnly.txt] ( https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_SequenceOnly.txt )
19
+ * [ BSA_P171_QID1638_TestProteins.fasta] ( https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestProteins.fasta )
20
+
21
+ Output files:
22
+ * [ BSA_P171_QID1638_TestPeptides_coverage.txt] ( https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_coverage.txt )
23
+ * [ BSA_P171_QID1638_TestPeptides_ProteinToPeptideMapping.txt] ( https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_ProteinToPeptideMapping.txt )
24
+ * [ BSA_P171_QID1638_TestPeptides_SequenceOnly_coverage.txt] ( https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_SequenceOnly_coverage.txt )
25
+ * [ BSA_P171_QID1638_TestPeptides_SequenceOnly_ProteinToPeptideMapping.txt] ( https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_SequenceOnly_ProteinToPeptideMapping.txt )
26
+
27
+
14
28
## Console Version Syntax
15
29
30
+ There are two options for computing protein coverage and peptide to protein mapping from the command line:
31
+ * ProteinCoverageSummarizerGUI.exe
32
+ * PeptideToProteinMapper.exe
33
+
34
+ ### Console Arguments for ProteinCoverageSummarizerGUI.exe
35
+
16
36
```
17
37
ProteinCoverageSummarizerGUI.exe
18
38
/I:PeptideInputFilePath /R:ProteinInputFilePath [/O:OutputDirectoryName]
@@ -21,7 +41,8 @@ ProteinCoverageSummarizerGUI.exe
21
41
22
42
The input file path can contain the wildcard character * . If a wildcard is
23
43
present, the same protein input file path will be used for each of the peptide
24
- input files matched.
44
+ input files matched. If the input file has one column of data, it is assumed to be Peptide Sequence.
45
+ If two columns of data (separated by a tab character), it is assumed to be Protein Name then Peptide Sequence.
25
46
26
47
The output directory name is optional. If omitted, the output files will be created
27
48
in the same directory as the input file. If included, a subdirectory is created with
@@ -30,17 +51,72 @@ the name OutputDirectoryName.
30
51
The parameter file path is optional. If included, it should point to a valid XML
31
52
parameter file.
32
53
33
- Use /G to ignore I/L differences when finding peptides in proteins or computing coverage.
54
+ Use ` /G ` to ignore I/L differences when finding peptides in proteins or computing coverage.
55
+
56
+ Use ` /H ` to suppress (hide) the protein sequence in the _ coverage.txt file.
57
+
58
+ Use ` /M ` to enable the creation of a protein to peptide mapping file.
59
+
60
+ Use ` /K ` to skip protein coverage computation steps
61
+
62
+ Use ` /Debug ` to keep the console open to see additional debug messages
63
+
64
+ Use ` /KeepDB ` to keep the SQLite database after processing (by default it is deleted)
65
+
66
+
67
+ ### Console Arguments for PeptideToProteinMapper.exe
68
+
69
+ The PeptideToProteinMapper reads in a text file containing peptide sequences then
70
+ searches the specified .fasta or text file containing protein names and sequences
71
+ to find the proteins that contain each peptide. The program will also compute
72
+ the sequence coverage percent for each protein (though this can be disable using /K).
73
+
74
+ ```
75
+ PeptideToProteinMapper.exe
76
+ /I:PeptideInputFilePath /R:ProteinInputFilePath
77
+ [/O:OutputDirectoryName] [/P:ParameterFilePath] [/F:FileFormatCode]
78
+ [/N:InspectParameterFilePath] [/G] [/H] [/K] [/A]
79
+ [/L[:LogFilePath]] [/LogDir:LogDirectoryPath] [/VerboseLog] [/Q]
80
+ ```
81
+
82
+ The input file path can contain the wildcard character * . If a wildcard is
83
+ present, the same protein input file path will be used for each of the peptide
84
+ input files matched.
85
+
86
+ The output directory name is optional. If omitted, the output files will be
87
+ created in the same directory as the input file. If included, then a subdirectory
88
+ is created with the name OutputDirectoryName.
89
+
90
+ The parameter file path is optional. If included, it should point to a valid XML
91
+ parameter file.
92
+
93
+ Use ` /F ` to specify the peptide input file format code. Options are:
94
+ * 0=Auto Determine: Treated as ` /F:1 ` unless name ends in _ inspect.txt, then ` /F:3 `
95
+ * 1=Peptide sequence in the 1st column (subsequent columns are ignored)
96
+ * 2=Protein name in 1st column and peptide sequence 2nd column
97
+ * 3=Inspect search results file (peptide sequence in the 3rd column)
98
+ * 4=MS-GF+ search results file (peptide sequence in the column titled 'Peptide'; optionally scan number in the column titled 'Scan')
99
+ * 5=SEQUEST, X!Tandem, Inspect, or MS-GF+ PHRP data file
100
+
101
+ When processing an Inspect search results file, use ` /N ` to specify the Inspect
102
+ parameter file used (required for determining the mod names embedded in the
103
+ identified peptides).
104
+
105
+ Use ` /G ` to ignore I/L differences when finding peptides in proteins or computing coverage
106
+
107
+ Use ` /H ` to suppress (hide) the protein sequence in the _ coverage.txt file
34
108
35
- Use /H to suppress (hide) the protein sequence in the _ coverage.txt file.
109
+ Use ` /K ` to skip the protein coverage computation steps (enabling faster processing)
36
110
37
- Use /M to enable the creation of a protein to peptide mapping file.
111
+ Use ` /A ` to create a copy of the source file, but with a new column listing the
112
+ mapped protein for each peptide. If a peptide maps to multiple proteins, then
113
+ multiple lines will be listed
38
114
39
- Use /K to skip protein coverage computation steps
115
+ Use ` /L ` to create a log file, optionally specifying the file name
40
116
41
- Use /Debug to keep the console open to see additional debug messages
117
+ Use ` /LogDir ` to define the directory in which the log file should be created
42
118
43
- Use /KeepDB to keep the SQLite database after processing (by default it is deleted)
119
+ Use ` /VerboseLog ` to create a detailed log file
44
120
45
121
## Contacts
46
122
0 commit comments