Sorry for the late reply. Here's the block of text I am searching.
--------- EFETCH RESULT(1..3): [ 1. Methods Mol Biol. 2014;1140:315-23. doi: 10.1007/978-1-4939-0354-2_ +23. Screening Ligands by X-ray crystallography. Davies DR(1). Author information: (1)Emerald Bio, 7869 NE Day Road W, Bainbridge Island, WA, 98110, USA, ddavies@embios.com. X-ray crystallography is an invaluable technique in structure-based dr +ug discovery, including fragment-based drug discovery, because it is the +only technique that can provide a complete three dimensional readout of the interaction between the small molecule and its macromolecular target. +X-ray diffraction (XRD) techniques can be employed as the sole method for co +nducting a screen of a fragment library, or it can be employed as the final techn +ique in a screening campaign to confirm putative "hit" compounds identified by a + variety of biochemical and/or biophysical screening techniques. Both approaches r +equire an efficient technique to prepare dozens to hundreds of crystals for data biochemical and/or biophysical screening techniques. Both approaches r +equire an efficient technique to prepare dozens to hundreds of crystals for data collection, and a reproducible way to deliver ligands to the crystal. +Here, a general method for screening cocktails of fragments is described. In c +ases where X-ray crystallography is employed as a method to verify putative hits, + the cocktails of fragments described below would simply be replaced with s +ingle fragment solutions. PMID: 24590727
I have a list of 79 blocks of text that are written as if for a Reference Section of a paper, I think Apa style of formatting. I want to extract the PubMed IDs from the files. I found a way to get the abstract from Pubmed, which contains the IDs. The problem is, it comes with similar hits, so I need to make certain I have the correct ID. Thus, a Title search. I have the titles in a hash that is linked to the number they were in the file. The original plan was to cycle through the hash searching these abstracts to get the correct abstract, then extract the ID from it.

The search "Screening Ligands by X-ray crystallography" doesn't work though. No match. "Screening Ligands by" does. I thought the issue may be the "-" anything before that works fine. Anything after it works too. but "X-ray" simply fails.


In reply to Re^2: Regular Expression Hiccup by Mindsword
in thread Regular Expression Hiccup by Mindsword

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.