Description of the CSEARCH Robot-Referee



What you have to do: What comes back from the CSEARCH-Robot-Referee: Input procedure: Tipps and Tricks: ********* Please keep in mind ********* If you have a problem with these facts, I offer a simple solution: Don't use this service !





Understanding the Report



The color coding scheme is similar to a „traffic light“
RED
Here is something wrong or at least there is a high probability for an error
YELLOW
A problem might be present
GREEN
Good, well assigned information




Structure Proposal:

The structure proposal as given by the author(s) is displayed – this is done for checking the input data.


Numbering Scheme:




Assigned chemical shift values:

The marked carbons have been fully assigned - the structure proposal is shown with all carbons highlighted in GREEN where the authors have assigned a chemical shift value; we use GREEN here, because this 1:1-correlation between carbon environments and chemical shift values is necessary to utilize the dataset later on for spectrum prediction. The fully assigned lines are summarized in a table below the structure. Below the table the structure is shown with the chemical shift values directly attached to the correspondig carbon atoms.


Exchangeable assigned chemical shift values:

The marked carbons have exchangeable assigned lines: Here the structure proposal is shown with all carbons highlighted in YELLOW where the authors have assigned lines, but these lines are grouped together into 'exchange groups' – therefore we use YELLOW here. The exchangeable assigned lines are summarized below the structure.


Exchangeable assigned chemical shift values summarized per group:

The marked carbons have exchangeable assigned lines: Here the structure proposal is shown with all carbons highlighted belonging to one 'exchange-group' in YELLOW. This is repeated for all occuring exchange-groups.


MIssing lines:

The marked carbons have no lines assigned: Here the chemical shift values are either missing or only given without any assignment, sometimes even without any multiplicity information. This should be avoided, because this makes a NMR-spectrum useless for storing in a database system – for this reason the corresponding carbons are marked with RED color. A list of unassigned chemical shift values is given below the structure together with a schematic spectrum.


Definition of stereocenters:

This display highlights all stereocenters avilable in the molecule (only on chiral carbons), a RED marker shows a center with a missing up(or down)-bond or an otherwise not properly defined chiral center.


Predicted versus experimental data:

Matching map of predicted versus experimental data: Here the structure is shown with highlighted carbon positions, a GREEN marker tells us a good coincidence between predicted and experimentally determined chemical shift value. A YELLOW marker is given when the predicted chemical shift deviates usually less than 10 ppm. A RED marker shows a l arge deviation (usually more than 10 ppm) between prediction and experimenent. A LARGE CONNECTED part of the structure marked with RED color is a severe hint for a wrong structural proposal. The above-mentioned deviation of 10ppm used for seperating YELLOW and RED markers may vary according to the elements present in your query-structure (in organometallic compounds it is increased) and also varies depending on the underlying parameters describing the quality of the spectrum prediction itself. A deviation of e.g. 8ppm between experimental and predicted value at the 5 shell-level has to be taken more seriously than a 10ppm deviation at the 3 shell-level. Simply believe that a YELLOW box should be checked anyway and a RED box is a very severe hint pointing to an assignment error and/or error in your structure proposal..


Increments:

Differences between predicted and experimental data in ppm: These differences are inserted into the structure display; for sake of clarity any increment lower than 1.0 ppm is ignored.


Bar graph: Visualization of the differences between experiment and prediction:

TEXT.


Visualization of increments:

The differences between predicted and experimental data are visualized as two spectra with connections between corresponding lines. This is an excellent way to detect interchanged signal assignments.



All experimental chemical shift values:

Experimental shift values as given by author(s): In this picture ALL lines are given, independent from any assignment. This picture simply visualizes the input of chemical shift values. The multiplicity information is coded into the color of the line.

Singulet - Dublet - Triplet - Quartet - Odd (S or T) - Even (D or Q) - Unknown

Experimental Chemical Shift Values using Symmetry:

TEXT.


Assigned spectrum:

Assigned spectrum as given by the author(s): Here only lines assigned by the author are given – the intention is definitely to convince some authors that the chemical community deserves only well-assigned spectra.


Best predicted Spectrum:

This image shows the 'best predicted spectrum' – in this case the multiplicities as derived from the assignment are used. The multiplicities are encoded into the color of the line.

Singulet - Dublet - Triplet - Quartet


Compatibility of multiplicity from structure and experiment:

Overall impression on compatibility of multiplicity from structure and experiment: The input data are checked for consistency. The multiplicities calculated from the structure are compared against the multiplicities supplied by the author, any inconsistency is highlighted in the table above, the overall impression with respect to this evaluation is summarized in the quality bar below.


Quality of the Spectrum Prediction:

Here the spectrum prediction itself is evaluated. Why? For ranking it is important to know to which extent the prediction was successful – in case of missing reference data in the CSEARCH-system behind, it is necessary to accept larger deviations for a better ranking. If this parameter is not taken into account, every structure which has uncommon functional groups with respect to the reference database, will be ranked lower. The consequence is a worse classification (e.g. 'Major Revision' instead of 'Minor Revision').


Best prediction:

Preferred Chemical Shift Values from both predictions: In this picture the structure is given with attached chemical shift values, which have selected as 'best choice' from both prediction techniques. The underlying color reflects the quality of the prediction.


Carbons with massive contribution from stereochemistry:

TEXT.


Visual comparison of NN-Prediction and HOSE-Prediction:

TEXT.


Analysis of Methods used:

Contribution of the Methods: The carbons in the structure are highlighted according to the source of the 'best predicted shift value'. A blue circle shows that the best predicted shift value is coming from the HOSE-code method, a violet circle tells us that the Network-value has been preferred. A red circle shows that both methods were unable to produce a prediction.


Proposal for improved signal assignment:

TEXT.


Similarity:

Similarity between predicted and experimental data based on positions: It is necessary to evaluate the resulting signal assignment in terms of 2 parameters, the first one is used here. Assume a situation, where you have an average deviation of 10ppm per carbon, which is extremely high, but this deviation is focused only on ONE, SINGLE POSITION within the structure. In such a case a typing error has a high probabilty (e.g. 151.1 instead of 51.1 ppm). When the average deviation is distributed over many carbons (in some cases 'connected carbons') then a high probability for an error with the structure proposal itself, is given.


Similarity:

Similarity between predicted and experimental data is X.Y ppm: It is necessary to evaluate the resulting signal assignment in terms of 2 parameters, the second one is used here. Assume a situation, where you have an average deviation of 10ppm per carbon, which is extremely high, but this deviation is focused only on ONE, SINGLE POSITION within the structure. In such a case a typing error has a high probabilty (e.g. 151.1 instead of 51.1 ppm). When the average deviation is distributed over many carbons (in some cases 'connected carbons') then a high probability for an error with the structure proposal itself, is given.


Symmetry Check:

Eventually Symmetry Error: Same environment - Different shiftvalue: This warning tells you, that you have different chemical shift values on identical functional groups. The underlying algorithm does not take into account stereochemistry, therefore you will get also warnings when the values are correct.


Symmetry Check:

Eventually Symmetry Error: Same shiftvalue – Different environment: This warning tells you, that you have identical chemical shift values on different functional groups – this might be a hint for a typo in your peaklist.


Symmetry Check:

Eventually Symmetry Error: Same environment - shiftvalue missing, but known: This warning tells you, that symmetrical atoms have assigned values and for the marked atom no value is given. This warning should not occur furthermore, because symmetrical carbon positions are automatically filled with the chemical shift values, whenever you enter the first value for this group.


The quality bar:

The quality bar shows in a graphics way the quality of the evaluation – the range is given from „Poor“ to „Good“. It should be stated, that this bar really covers the range from „Horrible“ to „Excellent“ - these two words reflect the situation with real-world NMR-data much better than the pair „Poor/Good“.


External links:

Search the Internet using 2D-/3D-Inchikeys: The Inchikey is a structure descriptor well-suited for text-based search-engines. Clicking the Descriptor launches a Google-search using either the 2D-inchikey (connectivity) or the 3D-inchikey (connectivity, stereochemistry, mobile hydrogens) retrieving all webpages which have been indexed by Google holding information on the identical structure.


Identical structure search:

All accessed CSEARCH-databases are searched for the query-structure – the structure comparison is based on the 2D-topology neglecting stereochemistry. In the case the identity is found, the structure and the spectrum together with the literature citation is shown. The literature citation is linked via the DOI (Digital Object Identifier) with the article at the publishers webpage. Accessing the literature depends on the license of your organization; abstracts are usually available for free.


Identical spectrum search:

The query spectrum is used to retrieve identical spectra from all accessed CSEARCH-databases – the spectrum comparison is only based on chemical shift values, the multiplicity information is ignored. You won't believe how many people "sell" the same spectrum twice without stating that this is a structure revision. Only entries having a different structure than the query-structure are shown.



Page written by: Wolfgang.Robien@c13nmr.at

Created on: 25-Feb-2010
Last update: 12-Mar-2021