首页 > 最新文献

Molecular Systems Design & Engineering最新文献

英文 中文
Molecular analysis and design using generative artificial intelligence via multi-agent modeling.
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-01-24 DOI: 10.1039/d4me00174e
Isabella Stewart, Markus J Buehler

We report the use of a multiagent generative artificial intelligence framework, the X-LoRA-Gemma large language model (LLM), to analyze, design and test molecular design. The X-LoRA-Gemma model, inspired by biological principles and featuring 7 billion parameters, dynamically reconfigures its structure through a dual-pass inference strategy to enhance its problem-solving abilities across diverse scientific domains. The model is used to first identify molecular engineering targets through a systematic human-AI and AI-AI self-driving multi-agent approach to elucidate key targets for molecular optimization to improve interactions between molecules. Next, a multi-agent generative design process is used that includes rational steps, reasoning and autonomous knowledge extraction. Target properties of the molecule are identified either using a principal component analysis (PCA) of key molecular properties or sampling from the distribution of known molecular properties. The model is then used to generate a large set of candidate molecules, which are analyzed via their molecular structure, charge distribution, and other features. We validate that as predicted, increased dipole moment and polarizability is indeed achieved in the designed molecules. We anticipate an increasing integration of these techniques into the molecular engineering workflow, ultimately enabling the development of innovative solutions to address a wide range of societal challenges. We conclude with a critical discussion of challenges and opportunities of the use of multi-agent generative AI for molecular engineering, analysis and design.

{"title":"Molecular analysis and design using generative artificial intelligence <i>via</i> multi-agent modeling.","authors":"Isabella Stewart, Markus J Buehler","doi":"10.1039/d4me00174e","DOIUrl":"10.1039/d4me00174e","url":null,"abstract":"<p><p>We report the use of a multiagent generative artificial intelligence framework, the X-LoRA-Gemma large language model (LLM), to analyze, design and test molecular design. The X-LoRA-Gemma model, inspired by biological principles and featuring 7 billion parameters, dynamically reconfigures its structure through a dual-pass inference strategy to enhance its problem-solving abilities across diverse scientific domains. The model is used to first identify molecular engineering targets through a systematic human-AI and AI-AI self-driving multi-agent approach to elucidate key targets for molecular optimization to improve interactions between molecules. Next, a multi-agent generative design process is used that includes rational steps, reasoning and autonomous knowledge extraction. Target properties of the molecule are identified either using a principal component analysis (PCA) of key molecular properties or sampling from the distribution of known molecular properties. The model is then used to generate a large set of candidate molecules, which are analyzed <i>via</i> their molecular structure, charge distribution, and other features. We validate that as predicted, increased dipole moment and polarizability is indeed achieved in the designed molecules. We anticipate an increasing integration of these techniques into the molecular engineering workflow, ultimately enabling the development of innovative solutions to address a wide range of societal challenges. We conclude with a critical discussion of challenges and opportunities of the use of multi-agent generative AI for molecular engineering, analysis and design.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" ","pages":""},"PeriodicalIF":3.2,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11868987/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143539515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Retraction: Heteroatoms chemical tailoring of aluminum nitrite nanotubes as biosensors for 5-hydroxyindole acetic acid (a biomarker for carcinoid tumors): insights from a computational study
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-01-10 DOI: 10.1039/D5ME90004B
Chioma B. Ubah, Martilda U. Akem, Innocent Benjamin, Henry O. Edet, Adedapo S. Adeyinka and Hitler Louis

Retraction of ‘Heteroatoms chemical tailoring of aluminum nitrite nanotubes as biosensors for 5-hydroxyindole acetic acid (a biomarker for carcinoid tumors): insights from a computational study’ by Chioma B. Ubah et al., Mol. Syst. Des. Eng., 2024, 9, 832–846, https://doi.org/10.1039/D4ME00019F.

{"title":"Retraction: Heteroatoms chemical tailoring of aluminum nitrite nanotubes as biosensors for 5-hydroxyindole acetic acid (a biomarker for carcinoid tumors): insights from a computational study","authors":"Chioma B. Ubah, Martilda U. Akem, Innocent Benjamin, Henry O. Edet, Adedapo S. Adeyinka and Hitler Louis","doi":"10.1039/D5ME90004B","DOIUrl":"https://doi.org/10.1039/D5ME90004B","url":null,"abstract":"<p >Retraction of ‘Heteroatoms chemical tailoring of aluminum nitrite nanotubes as biosensors for 5-hydroxyindole acetic acid (a biomarker for carcinoid tumors): insights from a computational study’ by Chioma B. Ubah <em>et al.</em>, <em>Mol. Syst. Des. Eng.</em>, 2024, <strong>9</strong>, 832–846, https://doi.org/10.1039/D4ME00019F.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 2","pages":" 167-167"},"PeriodicalIF":3.2,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/me/d5me90004b?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143107591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nanostructured liquid-crystalline ion conductors based on linear carbonate moieties: effects of oligooxyethylene and alkylene spacers on self-assembled properties and ionic conductivities†
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-01-06 DOI: 10.1039/D4ME00176A
Junya Uchida, Shingo Takegawa, Soshi Ito, Shunsuke Sato, Go Watanabe and Takashi Kato

We here report rodlike liquid-crystalline (LC) molecules consisting of bicyclohexyl and linear carbonate moieties connected through flexible spacers for the development of nanostructured ion-conductive materials. The molecular assemblies of the linear carbonate-based rodlike compounds mixed with a lithium salt provide 2D ion-conductive pathways in the smectic LC phases. The LC materials containing polar oligooxyethylene spacers coupled with linear carbonate moieties have been shown to function as efficient ion conductors, while those containing nonpolar alkylene spacers form thermally stable and ordered smectic LC structures. Molecular dynamics simulations provide insights into the conformation and packing of the molecules containing oligooxyethylene spacers in the LC phases. The combination of flexible oligooxyethylene chains and linear carbonates may lead to design of new LC electrolytes with highly mobile 2D nanochannels for applications in energy devices.

{"title":"Nanostructured liquid-crystalline ion conductors based on linear carbonate moieties: effects of oligooxyethylene and alkylene spacers on self-assembled properties and ionic conductivities†","authors":"Junya Uchida, Shingo Takegawa, Soshi Ito, Shunsuke Sato, Go Watanabe and Takashi Kato","doi":"10.1039/D4ME00176A","DOIUrl":"https://doi.org/10.1039/D4ME00176A","url":null,"abstract":"<p >We here report rodlike liquid-crystalline (LC) molecules consisting of bicyclohexyl and linear carbonate moieties connected through flexible spacers for the development of nanostructured ion-conductive materials. The molecular assemblies of the linear carbonate-based rodlike compounds mixed with a lithium salt provide 2D ion-conductive pathways in the smectic LC phases. The LC materials containing polar oligooxyethylene spacers coupled with linear carbonate moieties have been shown to function as efficient ion conductors, while those containing nonpolar alkylene spacers form thermally stable and ordered smectic LC structures. Molecular dynamics simulations provide insights into the conformation and packing of the molecules containing oligooxyethylene spacers in the LC phases. The combination of flexible oligooxyethylene chains and linear carbonates may lead to design of new LC electrolytes with highly mobile 2D nanochannels for applications in energy devices.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 3","pages":" 184-193"},"PeriodicalIF":3.2,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/me/d4me00176a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143532990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Process-based screening of porous materials for vacuum swing adsorption based on 1D classical density functional theory and PC-SAFT† 基于一维经典密度泛函理论和PC-SAFT的真空摇摆吸附多孔材料工艺筛选
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2025-01-01 DOI: 10.1039/D4ME00127C
Fabian Mayer, Benedikt Buhk, Johannes Schilling, Philipp Rehner, Joachim Gross and André Bardow

Adsorption-based processes are showing substantial potential for carbon capture. Due to the vast space of potential solid adsorbents and their influence on the process performance, the choice of the material is not trivial but requires systematic approaches. In particular, the material choice should be based on the performance of the resulting process. In this work, we present a method for the process-based screening of porous materials for pressure and vacuum swing adsorption. The method is based on an equilibrium process model that incorporates one-dimensional classical density functional theory (1D-DFT) and the PC-SAFT equation of state. Thereby, the presented method can efficiently screen databases of potential adsorbents and identify the best-performing materials as well as the corresponding optimized process conditions for a specific carbon capture application. We apply our method to a point-source carbon capture application at a cement plant. The results show that the process model is crucial to evaluating the performance of adsorbents instead of relying solely on material heuristics. Furthermore, we enhance our approach through multi-objective optimization and demonstrate for materials with high performance that our method is able to capture the trade-offs between two process objectives, such as specific work and purity. The presented method thus provides an efficient screening tool for adsorbents to maximize process performance.

以吸附为基础的工艺在碳捕获方面显示出巨大的潜力。由于潜在的固体吸附剂的广阔空间及其对工艺性能的影响,材料的选择不是微不足道的,而是需要系统的方法。特别是,材料的选择应根据所得到的工艺性能而定。在这项工作中,我们提出了一种基于工艺筛选多孔材料的压力和真空摆动吸附方法。该方法基于结合一维经典密度泛函理论(1D-DFT)和PC-SAFT状态方程的平衡过程模型。因此,该方法可以有效地筛选潜在吸附剂数据库,并为特定的碳捕获应用确定最佳性能的材料以及相应的优化工艺条件。我们将我们的方法应用于水泥厂的点源碳捕获应用。结果表明,过程模型是评价吸附剂性能的关键,而不是仅仅依靠材料启发式。此外,我们通过多目标优化增强了我们的方法,并证明了高性能材料,我们的方法能够捕获两个过程目标之间的权衡,例如比功和纯度。因此,提出的方法为吸附剂提供了一种有效的筛选工具,以最大限度地提高工艺性能。
{"title":"Process-based screening of porous materials for vacuum swing adsorption based on 1D classical density functional theory and PC-SAFT†","authors":"Fabian Mayer, Benedikt Buhk, Johannes Schilling, Philipp Rehner, Joachim Gross and André Bardow","doi":"10.1039/D4ME00127C","DOIUrl":"10.1039/D4ME00127C","url":null,"abstract":"<p >Adsorption-based processes are showing substantial potential for carbon capture. Due to the vast space of potential solid adsorbents and their influence on the process performance, the choice of the material is not trivial but requires systematic approaches. In particular, the material choice should be based on the performance of the resulting process. In this work, we present a method for the process-based screening of porous materials for pressure and vacuum swing adsorption. The method is based on an equilibrium process model that incorporates one-dimensional classical density functional theory (1D-DFT) and the PC-SAFT equation of state. Thereby, the presented method can efficiently screen databases of potential adsorbents and identify the best-performing materials as well as the corresponding optimized process conditions for a specific carbon capture application. We apply our method to a point-source carbon capture application at a cement plant. The results show that the process model is crucial to evaluating the performance of adsorbents instead of relying solely on material heuristics. Furthermore, we enhance our approach through multi-objective optimization and demonstrate for materials with high performance that our method is able to capture the trade-offs between two process objectives, such as specific work and purity. The presented method thus provides an efficient screening tool for adsorbents to maximize process performance.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 3","pages":" 219-227"},"PeriodicalIF":3.2,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11701972/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142941624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accelerating multicomponent phase-coexistence calculations with physics-informed neural networks†
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-12-24 DOI: 10.1039/D4ME00168K
Satyen Dhamankar, Shengli Jiang and Michael A. Webb

Phase separation in multicomponent mixtures is of significant interest in both fundamental research and technology. Although the thermodynamic principles governing phase equilibria are straightforward, practical determination of equilibrium phases and constituent compositions for multicomponent systems is often laborious and computationally intensive. Here, we present a machine-learning workflow that simplifies and accelerates phase-coexistence calculations. We specifically analyze capabilities of neural networks to predict the number, composition, and relative abundance of equilibrium phases of systems described by Flory–Huggins theory. We find that incorporating physics-informed material constraints into the neural network architecture enhances the prediction of equilibrium compositions compared to standard neural networks with minor errors along the boundaries of the stable region. However, introducing additional physics-informed losses does not lead to significant further improvement. These errors can be virtually eliminated by using machine-learning predictions as a warm-start for a subsequent optimization routine. This work provides a promising pathway to efficiently characterize multicomponent phase coexistence.

{"title":"Accelerating multicomponent phase-coexistence calculations with physics-informed neural networks†","authors":"Satyen Dhamankar, Shengli Jiang and Michael A. Webb","doi":"10.1039/D4ME00168K","DOIUrl":"https://doi.org/10.1039/D4ME00168K","url":null,"abstract":"<p >Phase separation in multicomponent mixtures is of significant interest in both fundamental research and technology. Although the thermodynamic principles governing phase equilibria are straightforward, practical determination of equilibrium phases and constituent compositions for multicomponent systems is often laborious and computationally intensive. Here, we present a machine-learning workflow that simplifies and accelerates phase-coexistence calculations. We specifically analyze capabilities of neural networks to predict the number, composition, and relative abundance of equilibrium phases of systems described by Flory–Huggins theory. We find that incorporating physics-informed material constraints into the neural network architecture enhances the prediction of equilibrium compositions compared to standard neural networks with minor errors along the boundaries of the stable region. However, introducing additional physics-informed losses does not lead to significant further improvement. These errors can be virtually eliminated by using machine-learning predictions as a warm-start for a subsequent optimization routine. This work provides a promising pathway to efficiently characterize multicomponent phase coexistence.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 2","pages":" 89-101"},"PeriodicalIF":3.2,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/me/d4me00168k?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143107592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding stable adsorption states in flexible soft porous coordination polymers through free energy profiles†
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-12-11 DOI: 10.1039/D4ME00154K
James E. Carpenter, Jean Galliano Vega Díaz, Johnathan Robinson and Yamil J. Colón

Soft porous coordination polymers (SPCPs) are flexible porous materials comprised of metal–organic polyhedrons (MOPs) connected by organic linkers, with potential in adsorption applications. We performed molecular simulations of various SPCPs that vary in the length and flexibility of the organic linkers to address how the flexibility can result in various configurations and affects adsorption performance. We examined free energy profiles as a function of volume of different SPCPs while varying methane loading, resulting in different stable configurations. We found significant differences in the volume of the stable configurations and their number for the various structures, with more flexible linkers having more stable configurations in free energy. We also characterized the textural properties and methane adsorption isotherms of the stable configurations for the SPCPs and analyzed density profiles of the adsorption in the various configurations. Altogether, our examination can be used to predict the relevant configurations of the SPCPs at a given loading and provides molecular-level understanding of how the flexibility of the organic linkers affects the structure of the system and adsorption performance.

{"title":"Understanding stable adsorption states in flexible soft porous coordination polymers through free energy profiles†","authors":"James E. Carpenter, Jean Galliano Vega Díaz, Johnathan Robinson and Yamil J. Colón","doi":"10.1039/D4ME00154K","DOIUrl":"https://doi.org/10.1039/D4ME00154K","url":null,"abstract":"<p >Soft porous coordination polymers (SPCPs) are flexible porous materials comprised of metal–organic polyhedrons (MOPs) connected by organic linkers, with potential in adsorption applications. We performed molecular simulations of various SPCPs that vary in the length and flexibility of the organic linkers to address how the flexibility can result in various configurations and affects adsorption performance. We examined free energy profiles as a function of volume of different SPCPs while varying methane loading, resulting in different stable configurations. We found significant differences in the volume of the stable configurations and their number for the various structures, with more flexible linkers having more stable configurations in free energy. We also characterized the textural properties and methane adsorption isotherms of the stable configurations for the SPCPs and analyzed density profiles of the adsorption in the various configurations. Altogether, our examination can be used to predict the relevant configurations of the SPCPs at a given loading and provides molecular-level understanding of how the flexibility of the organic linkers affects the structure of the system and adsorption performance.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 3","pages":" 194-204"},"PeriodicalIF":3.2,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/me/d4me00154k?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143532991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Is DFT enough? Towards accurate high-throughput computational screening of azobenzenes for molecular solar thermal applications† DFT足够了吗?用于分子太阳热应用的偶氮苯的精确高通量计算筛选
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-12-11 DOI: 10.1039/D4ME00183D
Flavia Aleotti, Lorenzo Soprani, Lucas F. Rodríguez-Almeida, Francesco Calcagno, Fabio Loprete, Ivan Rivalta, Silvia Orlandi, Elisabetta Canè, Marco Garavelli, Irene Conti and Luca Muccioli

An efficient screening of azobenzene (AB) derivatives for Molecular Solar Thermal (MOST) applications based on ground state properties (energy stored per molecule and Z isomer stability) could be performed with quasi-CASPT2 accuracy. In this work, we show how wavefunction and electron density based methods can be efficiently combined in a computational protocol that yields accurate potential energy profiles with a significant reduction in computational cost compared to that of a fully-CASPT2 characterization. Our results on prototypical electron donor/withdrawing AB derivatives clearly identify pull–pull substitution as the most promising, allowing to draw guidelines for the chemical design of promising azo-MOST candidates.

基于基态性质(每个分子储存的能量和Z异构体的稳定性),偶氮苯(AB)衍生物的分子太阳能热(MOST)应用的有效筛选可以以准caspt2精度进行。在这项工作中,我们展示了基于波函数和电子密度的方法如何有效地结合在一个计算协议中,与完全caspt2表征相比,该计算协议可以产生准确的势能分布,并显着降低计算成本。我们对原型电子供体/撤回AB衍生物的研究结果清楚地确定了拉-拉取代是最有希望的,从而为有希望的偶氮- most候选物的化学设计提供了指导。
{"title":"Is DFT enough? Towards accurate high-throughput computational screening of azobenzenes for molecular solar thermal applications†","authors":"Flavia Aleotti, Lorenzo Soprani, Lucas F. Rodríguez-Almeida, Francesco Calcagno, Fabio Loprete, Ivan Rivalta, Silvia Orlandi, Elisabetta Canè, Marco Garavelli, Irene Conti and Luca Muccioli","doi":"10.1039/D4ME00183D","DOIUrl":"https://doi.org/10.1039/D4ME00183D","url":null,"abstract":"<p >An efficient screening of azobenzene (AB) derivatives for Molecular Solar Thermal (MOST) applications based on ground state properties (energy stored per molecule and <em>Z</em> isomer stability) could be performed with quasi-CASPT2 accuracy. In this work, we show how wavefunction and electron density based methods can be efficiently combined in a computational protocol that yields accurate potential energy profiles with a significant reduction in computational cost compared to that of a fully-CASPT2 characterization. Our results on prototypical electron donor/withdrawing AB derivatives clearly identify pull–pull substitution as the most promising, allowing to draw guidelines for the chemical design of promising azo-MOST candidates.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 1","pages":" 13-18"},"PeriodicalIF":3.2,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/me/d4me00183d?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142912704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PepMNet: a hybrid deep learning model for predicting peptide properties using hierarchical graph representations†
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-12-11 DOI: 10.1039/D4ME00172A
Daniel Garzon Otero, Omid Akbari and Camille Bilodeau

Peptides are a powerful class of molecules that can be applied to a range of problems including biomaterials development and drug design. Currently, machine learning-based property prediction models for peptides primarily rely on amino acid sequence, resulting in two key limitations: first, they are not compatible with non-natural peptide features like modified sidechains or staples, and second, they use human-crafted features to describe the relationships between different amino acids, which reduces the model's flexibility and generalizability. To address these challenges, we have developed PepMNet, a deep learning model that integrates atom-level and amino acid-level information through a hierarchical graph approach. The model first learns from an atom-level graph and then generates amino acid representations based on the atomic information captured in the first stage. These amino acid representations are then combined using graph convolutions on an amino acid-level graph to produce a molecular-level representation, which is then passed to a fully connected neural network for property prediction. We evaluated this architecture by predicting two peptide properties: chromatographic retention time (RT) as a regression task and antimicrobial peptide (AMP) activity as a classification task. For the regression task, PepMNet achieved an average R2 of 0.980 across eight datasets, which spanned different dataset sizes and three liquid chromatography (LC) methods. For the classification task, we developed an ensemble of five models to reduce overfitting and ensure robust classification performance, achieving an area under the receiver operating curve (AUC-ROC) of 0.978 and an average precision of 0.981. Overall, our model illustrates the potential for hierarchical deep learning models to learn peptide properties without relying on human engineering amino acid features.

肽是一类功能强大的分子,可用于解决生物材料开发和药物设计等一系列问题。目前,基于机器学习的多肽性质预测模型主要依赖于氨基酸序列,这导致了两个关键的局限性:首先,这些模型与非天然多肽特征(如修饰侧链或主链)不兼容;其次,它们使用人为创建的特征来描述不同氨基酸之间的关系,这降低了模型的灵活性和通用性。为了应对这些挑战,我们开发了一种深度学习模型 PepMNet,它通过分层图的方法整合了原子层和氨基酸层的信息。该模型首先从原子级图中学习,然后根据第一阶段捕获的原子信息生成氨基酸表征。然后使用氨基酸级图上的图卷积将这些氨基酸表征组合起来,生成分子级表征,再将其传递给全连接神经网络进行属性预测。我们通过预测两种肽属性对这一架构进行了评估:色谱保留时间 (RT) 作为回归任务,抗菌肽 (AMP) 活性作为分类任务。在回归任务中,PepMNet 在不同数据集大小和三种液相色谱 (LC) 方法的八个数据集中取得了 0.980 的平均 R2。在分类任务中,我们开发了一个由五个模型组成的集合,以减少过拟合并确保稳健的分类性能,接收者操作曲线下面积 (AUC-ROC) 达到 0.978,平均精度达到 0.981。总之,我们的模型说明了分层深度学习模型在不依赖人类工程氨基酸特征的情况下学习肽特性的潜力。
{"title":"PepMNet: a hybrid deep learning model for predicting peptide properties using hierarchical graph representations†","authors":"Daniel Garzon Otero, Omid Akbari and Camille Bilodeau","doi":"10.1039/D4ME00172A","DOIUrl":"https://doi.org/10.1039/D4ME00172A","url":null,"abstract":"<p >Peptides are a powerful class of molecules that can be applied to a range of problems including biomaterials development and drug design. Currently, machine learning-based property prediction models for peptides primarily rely on amino acid sequence, resulting in two key limitations: first, they are not compatible with non-natural peptide features like modified sidechains or staples, and second, they use human-crafted features to describe the relationships between different amino acids, which reduces the model's flexibility and generalizability. To address these challenges, we have developed PepMNet, a deep learning model that integrates atom-level and amino acid-level information through a hierarchical graph approach. The model first learns from an atom-level graph and then generates amino acid representations based on the atomic information captured in the first stage. These amino acid representations are then combined using graph convolutions on an amino acid-level graph to produce a molecular-level representation, which is then passed to a fully connected neural network for property prediction. We evaluated this architecture by predicting two peptide properties: chromatographic retention time (RT) as a regression task and antimicrobial peptide (AMP) activity as a classification task. For the regression task, PepMNet achieved an average <em>R</em><small><sup>2</sup></small> of 0.980 across eight datasets, which spanned different dataset sizes and three liquid chromatography (LC) methods. For the classification task, we developed an ensemble of five models to reduce overfitting and ensure robust classification performance, achieving an area under the receiver operating curve (AUC-ROC) of 0.978 and an average precision of 0.981. Overall, our model illustrates the potential for hierarchical deep learning models to learn peptide properties without relying on human engineering amino acid features.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 3","pages":" 205-218"},"PeriodicalIF":3.2,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/me/d4me00172a?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143532992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the design of macromolecular-based inks for two-photon 3D laser printing†
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-12-10 DOI: 10.1039/D4ME00160E
Samantha O. Catt, Clara Vazquez-Martel and Eva Blasco

Two-photon 3D laser printing (2PLP) is one of the most versatile methods for additive manufacturing of micro- to nano-scale objects with arbitrary geometries and fine features. With advancing technological capability and accessibility, the demand for new and versatile inks is increasing, with a trend toward printing functional or responsive structures. One approach for ink design is the use of a macromolecular ink consisting of a ‘pre-polymer’ functionalized with photocrosslinkable groups to enable printability. However, so far the synthesis of pre-polymer inks for 2PLP often relies on an arbitrary choice rather than systematic design. Additionally, current structure–property relationship studies are limited to commercial or small molecule-based inks. Herein, three macromolecular inks with varied compositions, molecular weights, and glass transition temperatures are synthesized and formulated into inks for 2PLP. 3D microstructures are fabricated and characterized in-depth with scanning electron microscopy as well as infrared spectroscopy and nanoindentation to enable the determination of structure–processability–property relationships. Overall, it is clearly demonstrated that the macromolecular design plays a role in the printability and mechanical properties of the obtained materials.

{"title":"Investigating the design of macromolecular-based inks for two-photon 3D laser printing†","authors":"Samantha O. Catt, Clara Vazquez-Martel and Eva Blasco","doi":"10.1039/D4ME00160E","DOIUrl":"https://doi.org/10.1039/D4ME00160E","url":null,"abstract":"<p >Two-photon 3D laser printing (2PLP) is one of the most versatile methods for additive manufacturing of micro- to nano-scale objects with arbitrary geometries and fine features. With advancing technological capability and accessibility, the demand for new and versatile inks is increasing, with a trend toward printing functional or responsive structures. One approach for ink design is the use of a macromolecular ink consisting of a ‘pre-polymer’ functionalized with photocrosslinkable groups to enable printability. However, so far the synthesis of pre-polymer inks for 2PLP often relies on an arbitrary choice rather than systematic design. Additionally, current structure–property relationship studies are limited to commercial or small molecule-based inks. Herein, three macromolecular inks with varied compositions, molecular weights, and glass transition temperatures are synthesized and formulated into inks for 2PLP. 3D microstructures are fabricated and characterized in-depth with scanning electron microscopy as well as infrared spectroscopy and nanoindentation to enable the determination of structure–processability–property relationships. Overall, it is clearly demonstrated that the macromolecular design plays a role in the printability and mechanical properties of the obtained materials.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 3","pages":" 176-183"},"PeriodicalIF":3.2,"publicationDate":"2024-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/me/d4me00160e?page=search","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143532989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extrapolative machine learning models for copolymers
IF 3.2 3区 工程技术 Q2 CHEMISTRY, PHYSICAL Pub Date : 2024-11-27 DOI: 10.1039/D4ME00123K
Israrul H. Hashmi, Himanshu, Rahul Karmakar and Tarak K. Patra

Machine learning models have been progressively used for predicting materials' properties. These models can be built using pre-existing data and are useful for rapidly screening the physicochemical space of a material, which is astronomically large. However, ML models are inherently interpolative, and their efficacy for searching candidates outside a material's known range of properties is unresolved. Moreover, the performance of an ML model is intricately connected to its learning strategy and the volume of training data. Here, we determine the relationship between the extrapolation ability of an ML model, the size and range of its training dataset, and its learning approach. We focus on a canonical problem of predicting the properties of a copolymer as a function of the sequence of its monomers. Tree search algorithms, which learn the similarities between polymer structures, are found to be inefficient for extrapolation. Conversely, the extrapolation capability of neural networks and XGBoost models, which attempt to learn the underlying functional correlation between the structure and properties of polymers, shows strong correlations with the volume and range of training data. These findings have important implications on ML-based new material development.

{"title":"Extrapolative machine learning models for copolymers","authors":"Israrul H. Hashmi, Himanshu, Rahul Karmakar and Tarak K. Patra","doi":"10.1039/D4ME00123K","DOIUrl":"https://doi.org/10.1039/D4ME00123K","url":null,"abstract":"<p >Machine learning models have been progressively used for predicting materials' properties. These models can be built using pre-existing data and are useful for rapidly screening the physicochemical space of a material, which is astronomically large. However, ML models are inherently interpolative, and their efficacy for searching candidates outside a material's known range of properties is unresolved. Moreover, the performance of an ML model is intricately connected to its learning strategy and the volume of training data. Here, we determine the relationship between the extrapolation ability of an ML model, the size and range of its training dataset, and its learning approach. We focus on a canonical problem of predicting the properties of a copolymer as a function of the sequence of its monomers. Tree search algorithms, which learn the similarities between polymer structures, are found to be inefficient for extrapolation. Conversely, the extrapolation capability of neural networks and XGBoost models, which attempt to learn the underlying functional correlation between the structure and properties of polymers, shows strong correlations with the volume and range of training data. These findings have important implications on ML-based new material development.</p>","PeriodicalId":91,"journal":{"name":"Molecular Systems Design & Engineering","volume":" 2","pages":" 158-166"},"PeriodicalIF":3.2,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143107590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Molecular Systems Design & Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1