Skip to content

Commit d836f0a

Browse files
committed
Update the Readme files and Revision History
1 parent d36ba68 commit d836f0a

File tree

3 files changed

+106
-24
lines changed

3 files changed

+106
-24
lines changed

PeptideToProteinMapper/bin/README.md

Lines changed: 20 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -12,44 +12,47 @@ PeptideToProteinMapper.exe
1212
/I:PeptideInputFilePath /R:ProteinInputFilePath
1313
[/O:OutputDirectoryName] [/P:ParameterFilePath] [/F:FileFormatCode]
1414
[/N:InspectParameterFilePath] [/G] [/H] [/K] [/A]
15-
[/L[:LogFilePath]] [/LogDirectory:LogDirectoryPath] [/VerboseLog] [/Q]
15+
[/L[:LogFilePath]] [/LogDir:LogDirectoryPath] [/VerboseLog] [/Q]
1616
```
1717

1818
The input file path can contain the wildcard character *. If a wildcard is
1919
present, the same protein input file path will be used for each of the peptide
2020
input files matched.
2121

22-
The output directory name is optional. If omitted, the output files will be created
23-
in the same directory as the input file. If included, then a subdirectory is created
24-
with the name OutputDirectoryName.
22+
The output directory name is optional. If omitted, the output files will be
23+
created in the same directory as the input file. If included, then a subdirectory
24+
is created with the name OutputDirectoryName.
2525

26-
The parameter file path is optional. If included, it should point to a valid XML parameter file.
26+
The parameter file path is optional. If included, it should point to a valid XML
27+
parameter file.
2728

28-
Use /F to specify the peptide input file format code. Options are:
29-
* 0=Auto Determine: Treated as /F:1 unless name ends in _inspect.txt, then /F:3
29+
Use `/F` to specify the peptide input file format code. Options are:
30+
* 0=Auto Determine: Treated as `/F:1` unless name ends in _inspect.txt, then `/F:3`
3031
* 1=Peptide sequence in the 1st column (subsequent columns are ignored)
3132
* 2=Protein name in 1st column and peptide sequence 2nd column
3233
* 3=Inspect search results file (peptide sequence in the 3rd column)
3334
* 4=MS-GF+ search results file (peptide sequence in the column titled 'Peptide'; optionally scan number in the column titled 'Scan')
3435
* 5=SEQUEST, X!Tandem, Inspect, or MS-GF+ PHRP data file
3536

36-
When processing an Inspect search results file, use /N to specify the Inspect parameter file used
37-
(required for determining the mod names embedded in the identified peptides).
37+
When processing an Inspect search results file, use `/N` to specify the Inspect
38+
parameter file used (required for determining the mod names embedded in the
39+
identified peptides).
3840

39-
Use /G to ignore I/L differences when finding peptides in proteins or computing coverage
41+
Use `/G` to ignore I/L differences when finding peptides in proteins or computing coverage
4042

41-
Use /H to suppress (hide) the protein sequence in the _coverage.txt file
43+
Use `/H` to suppress (hide) the protein sequence in the _coverage.txt file
4244

43-
Use /K to skip the protein coverage computation steps (enabling faster processing)
45+
Use `/K` to skip the protein coverage computation steps (enabling faster processing)
4446

45-
Use /A to create a copy of the source file, but with a new column listing the mapped protein for each peptide.
46-
If a peptide maps to multiple proteins, then multiple lines will be listed.
47+
Use `/A` to create a copy of the source file, but with a new column listing the
48+
mapped protein for each peptide. If a peptide maps to multiple proteins, then
49+
multiple lines will be listed
4750

48-
Use /L to create a log file, optionally specifying the file name
51+
Use `/L` to create a log file, optionally specifying the file name
4952

50-
Use /LogDirectory to define the directory in which the log file should be created
53+
Use `/LogDir` to define the directory in which the log file should be created
5154

52-
Use /VerboseLog to create a detailed log file
55+
Use `/VerboseLog` to create a detailed log file
5356

5457
## Contacts
5558

README.md

Lines changed: 83 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -11,8 +11,28 @@ A graphical user interface (GUI) is provided to allow the user to select the inp
1111
The results browser displays the protein sequences, highlighting the residues that were present
1212
in the peptide input file, and providing sequence coverage stats for each protein.
1313

14+
## Example files:
15+
16+
Input files:
17+
* [BSA_P171_QID1638_TestPeptides.txt](https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides.txt)
18+
* [BSA_P171_QID1638_TestPeptides_SequenceOnly.txt](https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_SequenceOnly.txt)
19+
* [BSA_P171_QID1638_TestProteins.fasta](https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestProteins.fasta)
20+
21+
Output files:
22+
* [BSA_P171_QID1638_TestPeptides_coverage.txt](https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_coverage.txt)
23+
* [BSA_P171_QID1638_TestPeptides_ProteinToPeptideMapping.txt](https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_ProteinToPeptideMapping.txt)
24+
* [BSA_P171_QID1638_TestPeptides_SequenceOnly_coverage.txt](https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_SequenceOnly_coverage.txt)
25+
* [BSA_P171_QID1638_TestPeptides_SequenceOnly_ProteinToPeptideMapping.txt](https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer/blob/master/bin/BSA_P171_QID1638_TestPeptides_SequenceOnly_ProteinToPeptideMapping.txt)
26+
27+
1428
## Console Version Syntax
1529

30+
There are two options for computing protein coverage and peptide to protein mapping from the command line:
31+
* ProteinCoverageSummarizerGUI.exe
32+
* PeptideToProteinMapper.exe
33+
34+
### Console Arguments for ProteinCoverageSummarizerGUI.exe
35+
1636
```
1737
ProteinCoverageSummarizerGUI.exe
1838
/I:PeptideInputFilePath /R:ProteinInputFilePath [/O:OutputDirectoryName]
@@ -21,7 +41,8 @@ ProteinCoverageSummarizerGUI.exe
2141

2242
The input file path can contain the wildcard character *. If a wildcard is
2343
present, the same protein input file path will be used for each of the peptide
24-
input files matched.
44+
input files matched. If the input file has one column of data, it is assumed to be Peptide Sequence.
45+
If two columns of data (separated by a tab character), it is assumed to be Protein Name then Peptide Sequence.
2546

2647
The output directory name is optional. If omitted, the output files will be created
2748
in the same directory as the input file. If included, a subdirectory is created with
@@ -30,17 +51,72 @@ the name OutputDirectoryName.
3051
The parameter file path is optional. If included, it should point to a valid XML
3152
parameter file.
3253

33-
Use /G to ignore I/L differences when finding peptides in proteins or computing coverage.
54+
Use `/G` to ignore I/L differences when finding peptides in proteins or computing coverage.
55+
56+
Use `/H` to suppress (hide) the protein sequence in the _coverage.txt file.
57+
58+
Use `/M` to enable the creation of a protein to peptide mapping file.
59+
60+
Use `/K` to skip protein coverage computation steps
61+
62+
Use `/Debug` to keep the console open to see additional debug messages
63+
64+
Use `/KeepDB` to keep the SQLite database after processing (by default it is deleted)
65+
66+
67+
### Console Arguments for PeptideToProteinMapper.exe
68+
69+
The PeptideToProteinMapper reads in a text file containing peptide sequences then
70+
searches the specified .fasta or text file containing protein names and sequences
71+
to find the proteins that contain each peptide. The program will also compute
72+
the sequence coverage percent for each protein (though this can be disable using /K).
73+
74+
```
75+
PeptideToProteinMapper.exe
76+
/I:PeptideInputFilePath /R:ProteinInputFilePath
77+
[/O:OutputDirectoryName] [/P:ParameterFilePath] [/F:FileFormatCode]
78+
[/N:InspectParameterFilePath] [/G] [/H] [/K] [/A]
79+
[/L[:LogFilePath]] [/LogDir:LogDirectoryPath] [/VerboseLog] [/Q]
80+
```
81+
82+
The input file path can contain the wildcard character *. If a wildcard is
83+
present, the same protein input file path will be used for each of the peptide
84+
input files matched.
85+
86+
The output directory name is optional. If omitted, the output files will be
87+
created in the same directory as the input file. If included, then a subdirectory
88+
is created with the name OutputDirectoryName.
89+
90+
The parameter file path is optional. If included, it should point to a valid XML
91+
parameter file.
92+
93+
Use `/F` to specify the peptide input file format code. Options are:
94+
* 0=Auto Determine: Treated as `/F:1` unless name ends in _inspect.txt, then `/F:3`
95+
* 1=Peptide sequence in the 1st column (subsequent columns are ignored)
96+
* 2=Protein name in 1st column and peptide sequence 2nd column
97+
* 3=Inspect search results file (peptide sequence in the 3rd column)
98+
* 4=MS-GF+ search results file (peptide sequence in the column titled 'Peptide'; optionally scan number in the column titled 'Scan')
99+
* 5=SEQUEST, X!Tandem, Inspect, or MS-GF+ PHRP data file
100+
101+
When processing an Inspect search results file, use `/N` to specify the Inspect
102+
parameter file used (required for determining the mod names embedded in the
103+
identified peptides).
104+
105+
Use `/G` to ignore I/L differences when finding peptides in proteins or computing coverage
106+
107+
Use `/H` to suppress (hide) the protein sequence in the _coverage.txt file
34108

35-
Use /H to suppress (hide) the protein sequence in the _coverage.txt file.
109+
Use `/K` to skip the protein coverage computation steps (enabling faster processing)
36110

37-
Use /M to enable the creation of a protein to peptide mapping file.
111+
Use `/A` to create a copy of the source file, but with a new column listing the
112+
mapped protein for each peptide. If a peptide maps to multiple proteins, then
113+
multiple lines will be listed
38114

39-
Use /K to skip protein coverage computation steps
115+
Use `/L` to create a log file, optionally specifying the file name
40116

41-
Use /Debug to keep the console open to see additional debug messages
117+
Use `/LogDir` to define the directory in which the log file should be created
42118

43-
Use /KeepDB to keep the SQLite database after processing (by default it is deleted)
119+
Use `/VerboseLog` to create a detailed log file
44120

45121
## Contacts
46122

RevisionHistory.txt

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,8 @@
11
Protein Coverage Summarizer Change Log
22

3+
Version 1.3.7026; March 28, 2019
4+
- Check for closed or broken SQLite connection
5+
36
Version 1.3.7020; March 22, 2019
47
- Replaced SharedVBNetRoutines.dll with PRISMWin-library
58

0 commit comments

Comments
 (0)