<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>PharmaScreen Archives - Pharmacelera | Pushing the limits of computational chemistry</title>
	<atom:link href="https://pharmacelera.com/blog/tag/pharmascreen/feed/" rel="self" type="application/rss+xml" />
	<link></link>
	<description></description>
	<lastBuildDate>Thu, 01 Aug 2019 07:36:49 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://pharmacelera.com/wp-content/uploads/2022/04/cropped-Linkedin-avatar-32x32.png</url>
	<title>PharmaScreen Archives - Pharmacelera | Pushing the limits of computational chemistry</title>
	<link></link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Clustering methods for large molecular library screening</title>
		<link>https://pharmacelera.com/blog/science/clustering-methods-big-library-screening/</link>
		
		<dc:creator><![CDATA[Fernando Martín]]></dc:creator>
		<pubDate>Thu, 01 Aug 2019 07:36:47 +0000</pubDate>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[clustering]]></category>
		<category><![CDATA[hydrophobic descriptors]]></category>
		<category><![CDATA[non-hierarchical]]></category>
		<category><![CDATA[Pharmacelera]]></category>
		<category><![CDATA[PharmaScreen]]></category>
		<guid isPermaLink="false">https://new.pharmacelera.com/?p=4440</guid>

					<description><![CDATA[<p>Today, the amount of data generated in many fields such as engineering, social sciences or medicine is suffering a tremendous scale-up. Extraction [&#8230;]</p>
<p>The post <a href="https://pharmacelera.com/blog/science/clustering-methods-big-library-screening/">Clustering methods for large molecular library screening</a> appeared first on <a href="https://pharmacelera.com">Pharmacelera | Pushing the limits of computational chemistry</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>Today, the amount of data generated in many fields such as engineering, social sciences or medicine is suffering a tremendous scale-up. Extraction of relevant information is hence becoming increasingly challenging, and methods of data analysis such as clustering are now crucial. Clustering consists on the identification of homogeneous subgroups among a set of heterogeneous items. In the context of computer-aided drug discovery, where the chemical space is estimated to be 10<sup>63</sup> compounds, we can use it to select promising subgroups inside a large chemical library, getting rid of the bulk of the dataset with either no medical interest or redundancy <em>a priori</em>.</p>



<h2 class="wp-block-heading">Non-hierarchical methods</h2>



<p>Clustering algorithms vary largely on their efficiency, so some methods are more practical than others for their application on libraries of different size (Downs &amp; Barnard, 2002). For large datasets, the most suitable are non-hierarchical algorithms, that are based in a single partition of the data. From them, probably the most widely used is K-means (Forgy, 1965), that efficiently generates a user-defined number of clusters that are iteratively updated until an optimal classification is found. Compared with the family of nearest-neighbor algorithms such as the one developed by Darko Butina (Butina, 1999), K-means produces groups with a homogeneous size, although this sometimes comes at a cost of a higher internal heterogeneity (Walters, 2019). A faster implementation of K-means, called Mini Batch K-means (Sculley, 2010) uses just a small portion of the data to update the clusters in each iteration.</p>



<div class="wp-block-image"><figure class="aligncenter is-resized"><img fetchpriority="high" decoding="async" src="https://pharmacelera.com/wp-content/uploads/2019/08/clusters-1024x530.png" alt="" class="wp-image-4441" width="379" height="196" srcset="https://pharmacelera.com/wp-content/uploads/2019/08/clusters-1024x530.png 1024w, https://pharmacelera.com/wp-content/uploads/2019/08/clusters-300x155.png 300w, https://pharmacelera.com/wp-content/uploads/2019/08/clusters-768x397.png 768w, https://pharmacelera.com/wp-content/uploads/2019/08/clusters.png 1367w" sizes="(max-width: 379px) 100vw, 379px" /><figcaption>Hierarchical vs non-hierarchical clustering</figcaption></figure></div>



<h2 class="wp-block-heading">Our approach</h2>



<p>To assess the utility of Mini Batch K-means, we generated a dataset by combining all sets of the Directory of Useful Decoys with the library from Specs, considered as decoys. In total, the dataset contained more than 300K structures from which just 2,2K were active compounds. Applying our proposed multistep protocol that combines clustering with <a href="https://pharmacelera.com/science/">3D hydrophobic fields</a> overlays, we were able retrieve a significant number of hits, screening less than a 5% of all compounds, and more importantly with high chemical diversity.</p>



<figure class="wp-block-image"><img decoding="async" width="1024" height="867" src="https://pharmacelera.com/wp-content/uploads/2019/08/clustering_methods_big_library_screening-1024x867.png" alt="" class="wp-image-4442" srcset="https://pharmacelera.com/wp-content/uploads/2019/08/clustering_methods_big_library_screening-1024x867.png 1024w, https://pharmacelera.com/wp-content/uploads/2019/08/clustering_methods_big_library_screening-300x254.png 300w, https://pharmacelera.com/wp-content/uploads/2019/08/clustering_methods_big_library_screening-768x650.png 768w, https://pharmacelera.com/wp-content/uploads/2019/08/clustering_methods_big_library_screening.png 1924w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption>Combination of Mini Batch K-means with hydrophobic descriptors.</figcaption></figure>



<p>Although applying clustering methods might hide interesting structures compared to the integral screening of all compounds in a dataset, it also enables exploring a much larger chemical space that, otherwise, would be intractable. Future steps should be focused on finding the optimal number of clusters to generate, as well as increasing the quality of the chosen representative molecules.</p>



<p><strong>Which is your experience with clustering large chemical libraries? How do you think we should cope with these large datasets?</strong></p>



<p></p>



<ul class="wp-block-list"><li> Butina, D. (1999). <a href="https://pubs.acs.org/doi/10.1021/ci9803381">Unsupervised data base clustering based on daylight’s fingerprint and Tanimoto similarity: A fast and automated way to cluster small and large data sets</a>. <em>Journal of Chemical Information and Computer Sciences</em>, <em>39</em>(4), 747–750. https://doi.org/10.1021/ci9803381 </li><li> Downs, G. M., &amp; Barnard, J. M. (2002). <a href="https://onlinelibrary.wiley.com/doi/10.1002/0471433519.ch1">Clustering Methods and Their Uses in Computational Chemistry</a>. In <em>Reviews in Computational Chemistry, Volume 18</em> (pp. 1–40). Hoboken, New Jersey, USA: John Wiley &amp; Sons, Inc. https://doi.org/10.1002/0471433519.ch1 </li><li> Forgy, E. (1965). <a href="https://www.scirp.org/(S(czeh2tfqyw2orz553k1w0r45))/reference/ReferencesPapers.aspx?ReferenceID=2317605">Cluster analysis of multivariate data : efficiency versus interpretability of classifications</a>. <em>Biometrics</em>, <em>21</em>, 768–769. </li><li> Sculley, D. (2010). <a href="https://dl.acm.org/citation.cfm?id=1772862#">Web-Scale K-Means Clustering.</a> <em>Proceedings of the 19th International Conference on World Wide Web</em>, 1177–1178. </li><li> Walters, W. P. (2019). K-means Clustering. Retrieved from <a href="http://practicalcheminformatics.blogspot.com/2019/01/k-means-clustering.html">http://practicalcheminformatics.blogspot.com/2019/01/k-means-clustering.html</a> </li></ul>



<p><br></p>
<p>The post <a href="https://pharmacelera.com/blog/science/clustering-methods-big-library-screening/">Clustering methods for large molecular library screening</a> appeared first on <a href="https://pharmacelera.com">Pharmacelera | Pushing the limits of computational chemistry</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>New user interface for PharmScreen</title>
		<link>https://pharmacelera.com/blog/upgrades/new-user-interface-for-pharmscreen/</link>
		
		<dc:creator><![CDATA[Enric Gibert]]></dc:creator>
		<pubDate>Wed, 09 Aug 2017 14:45:47 +0000</pubDate>
				<category><![CDATA[Upgrades]]></category>
		<category><![CDATA[computational chemistry]]></category>
		<category><![CDATA[drug discovery]]></category>
		<category><![CDATA[PharmaScreen]]></category>
		<category><![CDATA[Virtual screening]]></category>
		<guid isPermaLink="false">https://www.pharmacelera.com/?p=3172</guid>

					<description><![CDATA[<p>Check out our new user interface for PharmScreen! Perform virtual screening campaigns with just a few clicks and take advantage of Pharmacelera&#8217;s [&#8230;]</p>
<p>The post <a href="https://pharmacelera.com/blog/upgrades/new-user-interface-for-pharmscreen/">New user interface for PharmScreen</a> appeared first on <a href="https://pharmacelera.com">Pharmacelera | Pushing the limits of computational chemistry</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Check out our new user interface for <a href="https://pharmacelera.com/pharmscreen/">PharmScreen</a>!</p>
<p>Perform virtual screening campaigns with just a few clicks and take advantage of Pharmacelera&#8217;s unique 3D hydrophobic fields!</p>
<p><a href="https://pharmacelera.com/pharmscreen/">PharmScreen</a> will find you leads with higher chemical diversity, increasing your chances of finding original scaffolds.</p>
<p>The post <a href="https://pharmacelera.com/blog/upgrades/new-user-interface-for-pharmscreen/">New user interface for PharmScreen</a> appeared first on <a href="https://pharmacelera.com">Pharmacelera | Pushing the limits of computational chemistry</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>PharmScreen field-based alignment outperforms traditional solutions</title>
		<link>https://pharmacelera.com/blog/publications/pharmscreen-field-based-alignment-outperforms-traditional-solutions/</link>
		
		<dc:creator><![CDATA[Enric Gibert]]></dc:creator>
		<pubDate>Sat, 05 Aug 2017 15:54:38 +0000</pubDate>
				<category><![CDATA[Publications]]></category>
		<category><![CDATA[drug discovery]]></category>
		<category><![CDATA[Pharmacelera]]></category>
		<category><![CDATA[PharmaScreen]]></category>
		<category><![CDATA[Virtual screening]]></category>
		<guid isPermaLink="false">https://www.pharmacelera.com/?p=3185</guid>

					<description><![CDATA[<p>PharmScreen field-based alignment outperforms traditional solutions by taking into account relevant ligand characteristics related with ligand-receptor interaction.PharmScreen superior alignment allows finding better [&#8230;]</p>
<p>The post <a href="https://pharmacelera.com/blog/publications/pharmscreen-field-based-alignment-outperforms-traditional-solutions/">PharmScreen field-based alignment outperforms traditional solutions</a> appeared first on <a href="https://pharmacelera.com">Pharmacelera | Pushing the limits of computational chemistry</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p><a href="https://pharmacelera.com/pharmscreen/">PharmScreen</a> field-based alignment<strong> outperforms traditional solutions</strong> by taking into account <strong>relevant ligand characteristics</strong> related with ligand-receptor interaction.<a href="https://pharmacelera.com/pharmscreen/">PharmScreen</a> superior alignment allows <strong>finding better leads </strong>in a drug discovery project.</p>
<p>In this figure, we can see how two cdk2 inhibitors are aligned by <a href="https://pharmacelera.com/pharmscreen/">PharmScreen</a> in comparison with traditional shape-based tools.</p>
<p><img decoding="async" class="size-medium aligncenter" src="https://image-store.slidesharecdn.com/d9a7002c-5af4-432e-ab4e-e4908808137a-original.png" width="1207" height="1027" /></p>
<p>The hydrophobic fields of the two ligands are shown as a lipophilic potential (a). PharmScreen alignments (c) emphasize the optimal hydrophilic/hydrophobic overlap while the alignment based on traditional descriptors (b) misses completely the necessary balance between hydrophobic and hydrophilic properties.</p>
<p>The post <a href="https://pharmacelera.com/blog/publications/pharmscreen-field-based-alignment-outperforms-traditional-solutions/">PharmScreen field-based alignment outperforms traditional solutions</a> appeared first on <a href="https://pharmacelera.com">Pharmacelera | Pushing the limits of computational chemistry</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
