<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>comparison Archives - Pharmacelera | Pushing the limits of computational chemistry</title>
	<atom:link href="https://pharmacelera.com/blog/tag/comparison/feed/" rel="self" type="application/rss+xml" />
	<link></link>
	<description></description>
	<lastBuildDate>Thu, 28 Oct 2021 08:12:03 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://pharmacelera.com/wp-content/uploads/2022/04/cropped-Linkedin-avatar-32x32.png</url>
	<title>comparison Archives - Pharmacelera | Pushing the limits of computational chemistry</title>
	<link></link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Measuring Virtual Screening Accuracy</title>
		<link>https://pharmacelera.com/blog/science/measuring-virtual-screening-accuracy/</link>
		
		<dc:creator><![CDATA[Fernando Martín]]></dc:creator>
		<pubDate>Wed, 11 Dec 2019 07:55:13 +0000</pubDate>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[BEDROCS]]></category>
		<category><![CDATA[comparison]]></category>
		<category><![CDATA[metrics]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[ROCE]]></category>
		<category><![CDATA[ROCS]]></category>
		<category><![CDATA[Virtual screening]]></category>
		<guid isPermaLink="false">https://new.pharmacelera.com/?p=5693</guid>

					<description><![CDATA[<p>By Javier Vazquez &#8211; Dec. 11, 2019 Virtual Screening (VS) is a core in-silico technology in medicinal and computational chemistry. Several tools [&#8230;]</p>
<p>The post <a href="https://pharmacelera.com/blog/science/measuring-virtual-screening-accuracy/">Measuring Virtual Screening Accuracy</a> appeared first on <a href="https://pharmacelera.com">Pharmacelera | Pushing the limits of computational chemistry</a>.</p>
]]></description>
										<content:encoded><![CDATA[		<div data-elementor-type="wp-post" data-elementor-id="5693" class="elementor elementor-5693" data-elementor-post-type="post">
						<section class="elementor-section elementor-top-section elementor-element elementor-element-fdd6797 elementor-section-boxed elementor-section-height-default elementor-section-height-default" data-id="fdd6797" data-element_type="section" data-e-type="section">
						<div class="elementor-container elementor-column-gap-default">
					<div class="elementor-column elementor-col-50 elementor-top-column elementor-element elementor-element-6410f01" data-id="6410f01" data-element_type="column" data-e-type="column">
			<div class="elementor-widget-wrap elementor-element-populated">
						<div class="elementor-element elementor-element-a18d09f elementor-widget elementor-widget-image" data-id="a18d09f" data-element_type="widget" data-e-type="widget" data-widget_type="image.default">
				<div class="elementor-widget-container">
															<img fetchpriority="high" decoding="async" width="697" height="715" src="https://pharmacelera.com/wp-content/uploads/2021/06/JV.jpg" class="attachment-large size-large wp-image-8184" alt="" srcset="https://pharmacelera.com/wp-content/uploads/2021/06/JV.jpg 697w, https://pharmacelera.com/wp-content/uploads/2021/06/JV-292x300.jpg 292w, https://pharmacelera.com/wp-content/uploads/2021/06/JV-230x236.jpg 230w, https://pharmacelera.com/wp-content/uploads/2021/06/JV-350x359.jpg 350w, https://pharmacelera.com/wp-content/uploads/2021/06/JV-480x492.jpg 480w" sizes="(max-width: 697px) 100vw, 697px" />															</div>
				</div>
					</div>
		</div>
				<div class="elementor-column elementor-col-50 elementor-top-column elementor-element elementor-element-34c3130" data-id="34c3130" data-element_type="column" data-e-type="column">
			<div class="elementor-widget-wrap elementor-element-populated">
						<div class="elementor-element elementor-element-d75cbb0 elementor-widget elementor-widget-text-editor" data-id="d75cbb0" data-element_type="widget" data-e-type="widget" data-widget_type="text-editor.default">
				<div class="elementor-widget-container">
									<p>By Javier Vazquez &#8211; Dec. 11, 2019</p>								</div>
				</div>
					</div>
		</div>
					</div>
		</section>
				<section class="elementor-section elementor-top-section elementor-element elementor-element-79c8e78a elementor-section-boxed elementor-section-height-default elementor-section-height-default" data-id="79c8e78a" data-element_type="section" data-e-type="section">
						<div class="elementor-container elementor-column-gap-default">
					<div class="elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-505fb828" data-id="505fb828" data-element_type="column" data-e-type="column">
			<div class="elementor-widget-wrap elementor-element-populated">
						<div class="elementor-element elementor-element-25949b21 elementor-widget elementor-widget-text-editor" data-id="25949b21" data-element_type="widget" data-e-type="widget" data-widget_type="text-editor.default">
				<div class="elementor-widget-container">
									
<p><p align="justify">Virtual Screening (VS) is a core in-silico technology in medicinal and computational chemistry. Several tools have been developed to exploit protein structures or collections of compounds to provide a quick and economical method for the discovery of novel active compounds. Hence, the comparative evaluation of VS algorithms becomes a fundamental exercise to assess the applicability of drug discovery tools. Here, we explain some metrics used to evaluate the accuracy of VS tools in retrospective studies, using known data sets of active and inactive compounds. The most common ones are <b>the area under the receiver operating characteristic (ROC) curve (AUC), the BEDROC score, the enrichment factor (EF)</b>, and <b>ROC enrichment (ROCe)</b>.</p></p>



<h3 class="wp-block-heading">How do we measure accuracy?</h3>



<p><p align="justify"><b>ROC curves</b> are one of the most used representations to compare VS algorithms performance. This kind of plot represents the relation between the fraction of true positives (active compounds) out of the total positives or sensitivity, versus the fraction of false positives out of the total negatives (inactive compounds) or specificity, as shown in Figure 1A. The closer the line to the upper left corner, the better, as it means that active molecules rank in the first positions of the ranking provided by a VS campaign.
</p></p>



<p><p align="justify">The most intuitive and easy way to compare ROC curves is the <b>AUC metric</b>, representing the area value under the ROC curve. AUC values range from 0 to 1, where 1 indicates a perfect classification of the screened compounds (i.e. all active compounds ranked before the inactive compounds). On the other hand, values lower than or equal to 0.5 are considered a bad prediction and are associated with a random classification.</p></p>



<p><p align="justify">The argument against using AUC values resides in the difficulty to characterize the initial part of a ranked list, particularly when we compare algorithms with high AUC score. . For example, an identical AUC value for two different VS methods (A and B) does not necessarily mean that both methods perform equally. As illustrated in Figure 1B, although methods A and B have the same AUC values, method A finds more active compounds in the initial part of the ranking, whereas method B retrieves more active compounds in the last part of the ranking.
</p></p>



<div class="wp-block-image"><figure class="aligncenter"><img decoding="async" width="584" height="306" src="https://new.pharmacelera.com/wp-content/uploads/2019/12/roc2.png" alt="roc curve example and roc curve comparison between two approaches to measure virtual screening performance" class="wp-image-5696" srcset="https://pharmacelera.com/wp-content/uploads/2019/12/roc2.png 584w, https://pharmacelera.com/wp-content/uploads/2019/12/roc2-300x157.png 300w" sizes="(max-width: 584px) 100vw, 584px" /></figure></div>



<p><p align="justify">
<i><b>Figure 1.</b> (A), Example of ROC curve. Hits: known active compound. Decoys: assumed non-active compounds. The shaded section (blue) is the area under the receiver operating characteristic (ROC) curve (AUC). (B), ROC curves for two different VS methods. Although VS method A (redline line) addresses the ‘‘early recognition’’ problem better than VS method B (blue line), their AUC are equal.</i>
</p></p>



<h3 class="wp-block-heading">The early recognition problem</h3>



<p><p align="justify">Early recognition of active molecules is of paramount importance in real-world screening applications, where researchers only test top-ranked molecules in biological assays due to their high costs. Hence, it is easy to argue that in the example above, method A performs better than method B. To evaluate this early enrichment behavior, several metrics have been proposed:
</p></p>



<ul class="wp-block-list"><li align="justify"><b>The Boltzmann-Enhanced Discrimination of ROC (BEDROC)</b> is a metric<sup>1</sup> that assigns more weight to early ranked molecules than late ranked molecules. The active compounds are weighted depending on their position in the ranking using an exponential function, ranging from 1.0 for the top ranked compound to close to zero for the lowest ranked compound. The exponential factor determines how much the BEDROC parameter focuses on the top of the list.The drawbacks of this metric are its dependency of the ratio of active/inactive compounds and the dependency of an extrinsic variable (adjustable exponential factor). That means that the value depends on the method and a particular experiment. Hence, sets with different active/inactive compound ratios cannot be compared directly using such a metric.
</li></ul>



<ul class="wp-block-list"><li align="justify"><b>Enrichment Factor (EF)</b> measures the fraction of active compounds found in a specific percentage, solving the problem of comparing the results for datasets with different active / inactive compound ratios. EF is a quite standard metric due to its intuitive interpretation related to the purpose of the VS itself (the ability to select a subset of molecules with a promising chance of finding a hit compound). However, although the EF metric is independent of adjustable parameters, it is still inﬂuenced by the number of active compounds in the dataset.  EF becomes smaller if fewer inactive molecules are initially present.</li><li><b>ROC enrichment (ROCe)</b> is the fraction of active compounds divided by the fraction of false positive compounds at a specific percentage of decoys retrieved. It represents the ability of the test to discriminate between two populations: active and inactive compounds. This approach solves the active/inactive compound ratio dependency present in the previous metrics. This metric only gives information about a defined percentage. Jain and Nicholls<sup>2</sup> suggest the 0.5, 1.0, 2, and 5 percentages to report the “early enrichment” values.</li></ul>



<h3 class="wp-block-heading">Exploring the chemical diversity</h3>



<p><p align="justify">As discussed, several metrics solve the early recognition problem. But, none of them distinguish different scenarios regarding chemical diversity. In general, a VS tool that ranks a given number of active compounds of different chemical families in the first positions performs better than a tool that ranks the same number of active compounds but all belonging to the same or fewer chemical families. <a href="https://new.pharmacelera.com/science/clustering-methods-big-library-screening/"><b>Clustering is a commonly used technique to classify compounds into different chemical families</b></a> based on scaffold similarity. To account for chemical diversity, the ROC curve metrics have been adjusted as follows<sup>3</sup></p></p>



<ul class="wp-block-list"><li align="justify"><b>Average-weighted ROC curve (awROC)</b>. The above scheme can be also embedded into the ROC enrichment. Thus, the value of the true positive hit is weighted depending on the cluster to which it belongs to and on the number of molecules in the cluster.</li><li align="justify"><b>Average-weighted Area  Under  the  Curve  (awAUC)</b>,  can  be  interpreted  as  the probability that  an  active  compound  with  a new  scaffold  is  ranked  before  an  inactive compound. Using an arithmetic weighting, each structure has a weight that is inversely proportional to the size of the cluster it belongs to. Therefore, the weight of all structures taken from one cluster is equal. Integrating this scheme into the basic AUC leads to an arithmetic weighted version (awAUC).</li></ul>


<p align="justify">The main drawback of the metrics that account for chemical diversity is that they are very sensitive to the methodology that is used to group molecules into chemical families.
</p>


<h3 class="wp-block-heading">Conclusions</h3>



<p><p align="justify">As discussed, the characterization of VS performance is of paramount importance to assess the applicability of these technologies in drug discovery. Despite the common aim of benchmarking studies, there is still no consensus on what is the best metric/s to analyze and compare results. Establishing standards in the field of cheminformatics, particularly in VS, is necessary to enhance the quality of publications and allow a reliable assessment of methods and progress.
</p></p>



<p><p align="justify"><strong>What is your opinion in this regard? What metric/s are you using to assess virtual screening tools? Choose your preferred metric and leave your comments in the survey!
</p></p>



<div style="height:100px" aria-hidden="true" class="wp-block-spacer"></div>



<div class="wp-block-button aligncenter"><a class="wp-block-button__link has-very-dark-gray-color has-very-light-gray-background-color has-text-color has-background no-border-radius" href="https://forms.gle/egYukSunuG72iFRK8">Go to survey</a></div>



<div style="height:100px" aria-hidden="true" class="wp-block-spacer"></div>



<p>REFERENCES</p>



<p>(1) &nbsp;Truchon, J.-F.; Bayly, C. I. Evaluating Virtual Screening Methods:  Good and Bad Metrics for the “Early Recognition” Problem. <em>J. Chem. Inf. Model.</em> <strong>2007</strong>, <em>47</em> (2), 488–508.</p>



<p>(2) &nbsp;Jain, AN; Nicholls, A. Recommendations for Evaluation of Computational Methods. <strong>2008</strong>, <em>22</em>, 133–139.</p>



<p>(3) Clark, R. D.; Webster-Clark, D. J. Managing Bias in ROC Curves. <em>J. Comput. Aided. Mol. Des.</em> <strong>2008</strong>, <em>22</em> (3–4), 141–146.</p>
								</div>
				</div>
					</div>
		</div>
					</div>
		</section>
				</div>
		<p>The post <a href="https://pharmacelera.com/blog/science/measuring-virtual-screening-accuracy/">Measuring Virtual Screening Accuracy</a> appeared first on <a href="https://pharmacelera.com">Pharmacelera | Pushing the limits of computational chemistry</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
