Re: [WebDNA] Middle Question

This WebDNA talk-list message is from

2015


It keeps the original formatting.
numero = 112019
interpreted = N
texte = --001a11c23742831214050d44d3e1 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable To Kens point: abc/def/ghi/jkl/mno/pqr/stu/vwx/ If you do this: [middle startafter=3Ddef&endbefore=3D/] You will get this: /ghi/jkl/mno/pqr/stu/vwx To my point, if I do [text]myVar=3Dabc/def/ghi/jkl/mno/pqr/stu/vwx/[/text] [middle startafter=3D[url]f/[/url]&endbefore=3D[url]/j[/url]][myVar][/middle]
you get: ghi it seems that a "/" on its own, and some other characters are not valid as start and end points. Regardless of MORE power to middle, working in a way that I can use the current version in a logical way, so that a start and end point work correctly. Right now, I'm totally stuck for a clean method of pulling known tags out of html dynamically .... it could be SO easy On Thu, Jan 22, 2015 at 4:49 PM, Christophe Billiottet < christophe.billiottet@webdna.us> wrote: > We are working with [middle] to give it more power. What would you like t= o > get in the next version? I took note of the "startbefore" and "endafter" > options suggested few days ago, but I am interested if someone has more > ideas. > > - Chris > > > > On 23 Jan 2015, at 00:36, Steve Craig wrote: > > I=E2=80=99ve always wondered if startafter and endafter were valid option= s but > never got round to testing. [Middle] has never really done what I thought > it should. > > Cheers > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > Steve Craig - Asylum Interactive > Tel +44 1330 860550 > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > http://www.asylumweb.com > Email: steve@asylumweb.com > Skype: s.craig - iChat: steve.craig > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > > > On 22 Jan 2015, at 21:29, David Bastedo wrote: > > For EndBefore, the docs say: String of text characters to search for > defining the end of the text to be returned. All following text (and the > EndBefore text itself) will be ignored. --> this suggests that it should > ignore everything after this, including the stop token itself. > > so, the block I am trying to remove: content=3D"Toronto Star copyeditor edits memo announcing the elimination = of > copyeditor jobs" /> > > by using : property=3D"og:title" as my starting point > and ideally, I want to use: " / as my end point. > > According to the docs, I should then get: content=3D"Toronto Star > copyeditor edits memo announcing the elimination of copyeditor jobs > > > regardless, it isn't working and it is returning the rest of the page > after the place where it is supposed to stop. It is not stopping. > If I url the start and end, I get the same result - no stoppage. > > > > [text]theOutPut=3D[tcpconnect host=3Dboingboing.net][!] > [/!][tcpsend skipheader=3DT]GET /2009/11/07/toronto-star-copyedi.html > HTTP/1.0[!] > [/!][UnURL]%0D%0A[/Unurl]Host: boingboing.net > [UnURL]%0D%0A%0D%0A[/UnURL][UnURL]%0D%0A[/Unurl][!] > [/!][/tcpsend][/tcpconnect][/text] > > [text]myProp2=3D[middle > startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]" > /[/url]][theOutPut][/middle][/text] > myProp2: [myProp2]
> > > Theoretically, I should be able to stop middle on the first quot that it > runs into, as opposed to " / and I'd take />, but this breaks middle: > > [middle > startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]/>[/url]][= theOutPut][/middle] > > this I get the same result : > [middle > startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]/[/url]][t= heOutPut][/middle] > as > > [middle startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]" > /[/url]][theOutPut][/middle] > > and > [middle > startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]"[/url]][t= heOutPut][/middle] > > > none of these will stop anything past the endpoint from displaying.... > any ideas? > > d. > > > On Thu, Jan 22, 2015 at 2:24 PM, Kenneth Grome > wrote: > >> endbefore works from the end of the specified text, thus you >> should expect this: >> >> content=3D"Toronto Star copyeditor edits memo announcing the >> elimination of copyeditor jobs" /> >> >> > content=3D"http://boingboing.net/2009/11/07/toronto-star-copyedi.html" >> /> >> > content=3D"http://craphound.com/images/torstarsubsnit.jpeg" >> >> Regards, >> Kenneth Grome >> WebDNA Solutions >> http://www.webdnasolutions.com >> Web Database Systems and Linux Server Management >> >> >> On 01/22/2015 12:52 PM, David Bastedo wrote: >> > Is there somebody who can shed some light on the proper use of >> > [middle]. It's not behaving as I would expect. >> > >> > I am attempting to extract open graph meta data from html after >> > grabbing it via [tcpconnect] >> > >> > so here is a sample of what I want to extract: >> > >> > >> > > announcing the elimination of copyeditor jobs" /> >> > >> > > http://boingboing.net/2009/11/07/toronto-star-copyedi.html" /> >> > > http://craphound.com/images/torstarsubsnit.jpeg" /> >> > >> > given this code: >> > [text]theOutPut=3D[tcpconnect host=3Dboingboing.net > >][!] >> > [/!][tcpsend skipheader=3DT]GET /2009/11/07/toronto-star-copyedi.html >> HTTP/1.0[!] >> > [/!][UnURL]%0D%0A[/Unurl]Host: boingboing.net > >[UnURL]%0D%0A%0D%0A[/UnURL][UnURL]%0D%0A[/Unurl][!] >> > [/!][/tcpsend][/tcpconnect][/text] >> > >> > [text]myProp=3D[middle startafter=3Dproperty=3D"og:title"&endbefore=3D= " >> /][theOutPut][/middle][/text] >> > >> > myProp: [myProp]
>> > >> > I would expect MyPprop to return: content=3D"Toronto Star copyeditor >> edits memo announcing the elimination of copyeditor jobs >> > >> > but, it is returning: myProp: content=3D"Toronto Star copyeditor >> > edits memo announcing the elimination of copyeditor jobs" /> and >> > the rest of the page >> > >> > If I url everything, it doesn't work at all. Theoretically, middle >> > is built for exactly this purpose and I should be able to run a >> > series of Middles to extract specific og tags that I pre-describe. >> > >> > i'm using 6.2 >> > >> > Thanks in advance! >> > >> > d. >> > >> > >> > >> > --------------------------------------------------------- This >> > message is sent to you because you are subscribed to the mailing >> > list . To unsubscribe, E-mail to: archives: >> > http://mail.webdna.us/list/talk@webdna.us Bug Reporting: >> > support@webdna.us >> --------------------------------------------------------- >> This message is sent to you because you are subscribed to >> the mailing list . >> To unsubscribe, E-mail to: >> archives: http://mail.webdna.us/list/talk@webdna.us >> Bug Reporting: support@webdna.us >> >> > > > -- > David Bastedo > Ten Plus One Communications Inc. > http://www.10plus1.com > 416.277.4499 > --------------------------------------------------------- This message i= s > sent to you because you are subscribed to the mailing list . To > unsubscribe, E-mail to: archives: > http://mail.webdna.us/list/talk@webdna.us Bug Reporting: support@webdna.u= s > > > --------------------------------------------------------- This message is > sent to you because you are subscribed to the mailing list . To > unsubscribe, E-mail to: archives: > http://mail.webdna.us/list/talk@webdna.us Bug Reporting: support@webdna.u= s > --------------------------------------------------------- This message is > sent to you because you are subscribed to the mailing list . To > unsubscribe, E-mail to: archives: > http://mail.webdna.us/list/talk@webdna.us Bug Reporting: support@webdna.u= s > --=20 David Bastedo Ten Plus One Communications Inc. http://www.10plus1.com 416.277.4499 --001a11c23742831214050d44d3e1 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
To Kens point:

abc/def/ghi/jkl/mno/pqr/stu/vwx= /

If you do this:

[middle startafter=3Ddef&endbefore=3D/]

You will get this:

/ghi/jkl/mno/pqr/stu/vwx


To my point, if I do
[text]myVar=3Dabc/def/ghi/jkl/mno/pqr/stu/vwx/[/text]
[middle startaft= er=3D[url]f/[/url]&endbefore=3D[url]/j[/url]][myVar][/middle]<br /&g= t;

you get: ghi

it seems that a "/= " on its own, and some other characters are not valid as start and end= points.


Regardless of MORE power to middle, working = in a way that I can use the current version in a logical way, so that a sta= rt and end point work correctly. Right now, I'm totally stuck for a cle= an method of pulling known tags out of html dynamically .... it could be SO= easy



On Thu, Jan 22, 2015 at 4:49 PM, Christophe Billiott= et <christophe.billiottet@webdna.us> wrote:
We are working with = [middle] to give it more power. What would you like to get in the next vers= ion? I took note of the "startbefore" and "endafter" op= tions suggested few days ago, but I am interested if someone has more ideas= .

- Chris


=

On 23 Jan 2015, at 00:36, Steve Craig <steve@asylumweb.com> wrote:

=
I=E2=80=99ve always wondered if startafter and endafter were val= id options but never got round to testing. [Middle] has never really done w= hat I thought it should.

Cheers
<= /div>
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
Steve C= raig - Asylum Interactive
= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
Skype: s.craig - iChat: steve.craig
=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D



On 22 Jan 2015, at 21:29, David Bas= tedo <david@10plu= s1.com> wrote:

For EndBefore, the docs say: String of tex= t characters to search for defining the end of the text to=20 be returned. All following text (and the EndBefore text itself) will be=20 ignored. --> this suggests that it should ignore everything after this, = including the stop token itself.

so, the block I am trying to = remove: <meta property=3D"og:title" content=3D"Toro= nto Star copyeditor edits memo announcing the elimination of copyeditor job= s" />

by using : property=3D= "og:title" as my starting point
and ideally= , I want to use: " / as my end point.

According to= the docs, I should then get:
content=3D"Toronto St= ar copyeditor edits memo announcing the elimination of copyeditor jobs


regardless, it isn't working and it is= returning the rest of the page after the place where it is supposed to sto= p. It is not stopping.
If I url the start and end, I = get the same result - no stoppage.



[text]theOutPut=3D[tcpcon= nect host=3Dboingboing= .net][!]
[/!][tcpsend skipheader=3DT]GET /2009/11/07/toronto-star-co= pyedi.html HTTP/1.0[!]
[/!][UnURL]%0D%0A[/Unurl]Host: boingboing.net[UnURL]%0D%0A%0D%0A[/U= nURL][UnURL]%0D%0A[/Unurl][!]
[/!][/tcpsend][/tcpconnect][/text]

= [text]myProp2=3D[middle startafter=3D[url]property=3D"og:image"[/= url]&endbefore=3D[url]" /[/url]][theOutPut][/middle][/text]=C2=A0 =
myProp2: [myProp2]<br />


Theoretica= lly, I should be able to stop middle on the first quot that it runs into, a= s opposed to " / and I'd take />, but this breaks middle:
[middle startafter=3D[url]property=3D"og:image"[/url]&endbe= fore=3D[url]/>[/url]][theOutPut][/middle]

this= I get the same result :
[middle startafter=3D[url]property=3D"og:i= mage"[/url]&endbefore=3D[url]/[/url]][theOutPut][/middle]
as

[middle startafter=3D[url]property=3D&= quot;og:image"[/url]&endbefore=3D[url]" /[/url]][theOutPut][/= middle]

and
[middle startafter=3D= [url]property=3D"og:image"[/url]&endbefore=3D[url]"[/url= ]][theOutPut][/middle]


none of these will stop anyt= hing past the endpoint from displaying....=C2=A0 any ideas?

d.=


On Thu, Jan 22, 2015 at 2:24 PM, Kenneth = Grome <ken@webdnasolutions.com> wrote:
endbefore works from the end of the specified text, thus= you
should expect this:

=C2=A0content=3D"Toronto Star copyeditor edits memo announcing the
elimination of copyeditor jobs" />
<meta property=3D"og:description" content=3D"" />=
<meta property=3D"og:url"
content=3D"http://boingboing.net/2009/11/07/toronto-star= -copyedi.html"
/>
<meta property=3D"og:image"
content=3D"http://craphound.com/images/torstarsubsnit.jpeg"= ;

Regards,
Kenneth Grome
WebDNA Solutions
http://www.we= bdnasolutions.com
Web Database Systems and Linux Server Management


On 01/22/2015 12:52 PM, David Bastedo wrote:
> Is there somebody who can shed some light on the proper use of
> [middle]. It's not behaving as I would expect.
>
> I am attempting to extract open graph meta data from html after
> grabbing it via [tcpconnect]
>
> so here is a sample of what I want to extract:
>
> <meta property=3D"og:type" content=3D"article" = />
> <meta property=3D"og:title" content=3D"Toronto Star = copyeditor edits memo announcing the elimination of copyeditor jobs" /= >
> <meta property=3D"og:description" content=3D"" = />
> <meta property=3D"og:url" content=3D"h= ttp://boingboing.net/2009/11/07/toronto-star-copyedi.html" /> > <meta property=3D"og:image" content=3D"http://cra= phound.com/images/torstarsubsnit.jpeg" />
>
> given this code:
> [text]theOutPut=3D[tcpconnect host=3Dboingboing.net <http://boingboing.net>][!]
> [/!][tcpsend skipheader=3DT]GET /2009/11/07/toronto-star-copyedi= .html HTTP/1.0[!]
> [/!][UnURL]%0D%0A[/Unurl]Host: boingboing.net <http://boingboing.net>[UnURL]%0D%0A%0D%0A[/UnU= RL][UnURL]%0D%0A[/Unurl][!]
> [/!][/tcpsend][/tcpconnect][/text]
>
> [text]myProp=3D[middle startafter=3Dproperty=3D"og:title"&am= p;endbefore=3D" /][theOutPut][/middle][/text]
>
> myProp: [myProp]<br />
>
> I would expect MyPprop to return:=C2=A0 =C2=A0content=3D"Toronto = Star copyeditor edits memo announcing the elimination of copyeditor jobs >
> but, it is returning: myProp: content=3D"Toronto Star copyeditor<= br> > edits memo announcing the elimination of copyeditor jobs" /> a= nd
> the rest of the page
>
> If I url everything, it doesn't work at all. Theoretically, middle=
> is built for exactly this purpose and I should be able to run a
> series of Middles to extract specific og tags that I pre-describe.
>
> i'm using 6.2
>
> Thanks in advance!
>
> d.
>
>
>
> --------------------------------------------------------- This<= br> > message is sent to you because you are subscribed to the mailing
> list . To unsubscribe, E-mail to: archives:
> http://mail.webdna.us/list/talk@webdna.us Bug Reporting:
> support@webdna.= us
---------------------------------------------------------
This message is sent to you because you are subscribed to
the mailing list <ta= lk@webdna.us>.
To unsubscribe, E-mail to: <talk-leave@webdna.us>
archives: http://mail.webdna.us/list/talk@webdna.us
Bug Reporting: suppo= rt@webdna.us




--
David Bastedo
<= div>
Ten Plus = One Communications Inc.
http://www.10plus1.com
416.277.4499
--------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: archives: http://mail.webdna.us/list/talk@webdna.us Bug Reporting: suppo= rt@webdna.us

--------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: archives: http://mail.webdna.us/list/talk@webdna.us Bug Reporting: suppo= rt@webdna.us
--------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: archives: http://mail.webdna.us/list/talk@webdna.us Bug Reporting: suppo= rt@webdna.us



--=
David Bastedo

Ten Plus One Communications Inc.http://www.10plus1.c= om
416.277.4499

--001a11c23742831214050d44d3e1-- Associated Messages, from the most recent to the oldest:

    
  1. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  2. Re: [WebDNA] Middle Question (christophe.billiottet@webdna.us 2015)
  3. Re: [WebDNA] Middle Question (David Bastedo 2015)
  4. Re: [WebDNA] Middle Question (Tom Duke 2015)
  5. Re: [WebDNA] Middle Question (christophe.billiottet@webdna.us 2015)
  6. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  7. Re: [WebDNA] Middle Question (David Bastedo 2015)
  8. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  9. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  10. Re: [WebDNA] Middle Question (David Bastedo 2015)
  11. Re: [WebDNA] Middle Question (Terry Wilson 2015)
  12. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  13. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  14. Re: [WebDNA] Middle Question (David Bastedo 2015)
  15. Re: [WebDNA] Middle Question (David Bastedo 2015)
  16. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  17. Re: [WebDNA] Middle Question (Terry Wilson 2015)
  18. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  19. Re: [WebDNA] Middle Question (Chris 2015)
  20. Re: [WebDNA] Middle Question (Chris 2015)
  21. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  22. Re: [WebDNA] Middle Question (Steve Craig 2015)
  23. Re: [WebDNA] Middle Question (David Bastedo 2015)
  24. Re: [WebDNA] Middle Question (Kenneth Grome 2015)
  25. [WebDNA] Middle Question (David Bastedo 2015)
--001a11c23742831214050d44d3e1 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable To Kens point: abc/def/ghi/jkl/mno/pqr/stu/vwx/ If you do this: [middle startafter=3Ddef&endbefore=3D/] You will get this: /ghi/jkl/mno/pqr/stu/vwx To my point, if I do [text]myVar=3Dabc/def/ghi/jkl/mno/pqr/stu/vwx/[/text] [middle startafter=3D[url]f/[/url]&endbefore=3D[url]/j[/url]][myVar][/middle]
you get: ghi it seems that a "/" on its own, and some other characters are not valid as start and end points. Regardless of MORE power to middle, working in a way that I can use the current version in a logical way, so that a start and end point work correctly. Right now, I'm totally stuck for a clean method of pulling known tags out of html dynamically .... it could be SO easy On Thu, Jan 22, 2015 at 4:49 PM, Christophe Billiottet < christophe.billiottet@webdna.us> wrote: > We are working with [middle] to give it more power. What would you like t= o > get in the next version? I took note of the "startbefore" and "endafter" > options suggested few days ago, but I am interested if someone has more > ideas. > > - Chris > > > > On 23 Jan 2015, at 00:36, Steve Craig wrote: > > I=E2=80=99ve always wondered if startafter and endafter were valid option= s but > never got round to testing. [middle] has never really done what I thought > it should. > > Cheers > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > Steve Craig - Asylum Interactive > Tel +44 1330 860550 > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > http://www.asylumweb.com > Email: steve@asylumweb.com > Skype: s.craig - iChat: steve.craig > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > > > > On 22 Jan 2015, at 21:29, David Bastedo wrote: > > For EndBefore, the docs say: String of text characters to search for > defining the end of the text to be returned. All following text (and the > EndBefore text itself) will be ignored. --> this suggests that it should > ignore everything after this, including the stop token itself. > > so, the block I am trying to remove: content=3D"Toronto Star copyeditor edits memo announcing the elimination = of > copyeditor jobs" /> > > by using : property=3D"og:title" as my starting point > and ideally, I want to use: " / as my end point. > > According to the docs, I should then get: content=3D"Toronto Star > copyeditor edits memo announcing the elimination of copyeditor jobs > > > regardless, it isn't working and it is returning the rest of the page > after the place where it is supposed to stop. It is not stopping. > If I url the start and end, I get the same result - no stoppage. > > > > [text]theOutPut=3D[tcpconnect host=3Dboingboing.net][!] > [/!][tcpsend skipheader=3DT]GET /2009/11/07/toronto-star-copyedi.html > HTTP/1.0[!] > [/!][unurl]%0D%0A[/Unurl]Host: boingboing.net > [unurl]%0D%0A%0D%0A[/UnURL][unurl]%0D%0A[/Unurl][!] > [/!][/tcpsend][/tcpconnect][/text] > > [text]myProp2=3D[middle > startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]" > /[/url]][theOutPut][/middle][/text] > myProp2: [myProp2]
> > > Theoretically, I should be able to stop middle on the first quot that it > runs into, as opposed to " / and I'd take />, but this breaks middle: > > [middle > startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]/>[/url]][= theOutPut][/middle] > > this I get the same result : > [middle > startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]/[/url]][t= heOutPut][/middle] > as > > [middle startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]" > /[/url]][theOutPut][/middle] > > and > [middle > startafter=3D[url]property=3D"og:image"[/url]&endbefore=3D[url]"[/url]][t= heOutPut][/middle] > > > none of these will stop anything past the endpoint from displaying.... > any ideas? > > d. > > > On Thu, Jan 22, 2015 at 2:24 PM, Kenneth Grome > wrote: > >> endbefore works from the end of the specified text, thus you >> should expect this: >> >> content=3D"Toronto Star copyeditor edits memo announcing the >> elimination of copyeditor jobs" /> >> >> > content=3D"http://boingboing.net/2009/11/07/toronto-star-copyedi.html" >> /> >> > content=3D"http://craphound.com/images/torstarsubsnit.jpeg" >> >> Regards, >> Kenneth Grome >> WebDNA Solutions >> http://www.webdnasolutions.com >> Web Database Systems and Linux Server Management >> >> >> On 01/22/2015 12:52 PM, David Bastedo wrote: >> > Is there somebody who can shed some light on the proper use of >> > [middle]. It's not behaving as I would expect. >> > >> > I am attempting to extract open graph meta data from html after >> > grabbing it via [tcpconnect] >> > >> > so here is a sample of what I want to extract: >> > >> > >> > > announcing the elimination of copyeditor jobs" /> >> > >> > > http://boingboing.net/2009/11/07/toronto-star-copyedi.html" /> >> > > http://craphound.com/images/torstarsubsnit.jpeg" /> >> > >> > given this code: >> > [text]theOutPut=3D[tcpconnect host=3Dboingboing.net > >][!] >> > [/!][tcpsend skipheader=3DT]GET /2009/11/07/toronto-star-copyedi.html >> HTTP/1.0[!] >> > [/!][unurl]%0D%0A[/Unurl]Host: boingboing.net > >[unurl]%0D%0A%0D%0A[/UnURL][unurl]%0D%0A[/Unurl][!] >> > [/!][/tcpsend][/tcpconnect][/text] >> > >> > [text]myProp=3D[middle startafter=3Dproperty=3D"og:title"&endbefore=3D= " >> /][theOutPut][/middle][/text] >> > >> > myProp: [myProp]
>> > >> > I would expect MyPprop to return: content=3D"Toronto Star copyeditor >> edits memo announcing the elimination of copyeditor jobs >> > >> > but, it is returning: myProp: content=3D"Toronto Star copyeditor >> > edits memo announcing the elimination of copyeditor jobs" /> and >> > the rest of the page >> > >> > If I url everything, it doesn't work at all. Theoretically, middle >> > is built for exactly this purpose and I should be able to run a >> > series of Middles to extract specific og tags that I pre-describe. >> > >> > i'm using 6.2 >> > >> > Thanks in advance! >> > >> > d. >> > >> > >> > >> > --------------------------------------------------------- This >> > message is sent to you because you are subscribed to the mailing >> > list . To unsubscribe, E-mail to: archives: >> > http://mail.webdna.us/list/talk@webdna.us Bug Reporting: >> > support@webdna.us >> --------------------------------------------------------- >> This message is sent to you because you are subscribed to >> the mailing list . >> To unsubscribe, E-mail to: >> archives: http://mail.webdna.us/list/talk@webdna.us >> Bug Reporting: support@webdna.us >> >> > > > -- > David Bastedo > Ten Plus One Communications Inc. > http://www.10plus1.com > 416.277.4499 > --------------------------------------------------------- This message i= s > sent to you because you are subscribed to the mailing list . To > unsubscribe, E-mail to: archives: > http://mail.webdna.us/list/talk@webdna.us Bug Reporting: support@webdna.u= s > > > --------------------------------------------------------- This message is > sent to you because you are subscribed to the mailing list . To > unsubscribe, E-mail to: archives: > http://mail.webdna.us/list/talk@webdna.us Bug Reporting: support@webdna.u= s > --------------------------------------------------------- This message is > sent to you because you are subscribed to the mailing list . To > unsubscribe, E-mail to: archives: > http://mail.webdna.us/list/talk@webdna.us Bug Reporting: support@webdna.u= s > --=20 David Bastedo Ten Plus One Communications Inc. http://www.10plus1.com 416.277.4499 --001a11c23742831214050d44d3e1 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
To Kens point:

abc/def/ghi/jkl/mno/pqr/stu/vwx= /

If you do this:

[middle startafter=3Ddef&endbefore=3D/]

You will get this:

/ghi/jkl/mno/pqr/stu/vwx


To my point, if I do
[text]myVar=3Dabc/def/ghi/jkl/mno/pqr/stu/vwx/[/text]
[middle startaft= er=3D[url]f/[/url]&endbefore=3D[url]/j[/url]][myVar][/middle]<br /&g= t;

you get: ghi

it seems that a "/= " on its own, and some other characters are not valid as start and end= points.


Regardless of MORE power to middle, working = in a way that I can use the current version in a logical way, so that a sta= rt and end point work correctly. Right now, I'm totally stuck for a cle= an method of pulling known tags out of html dynamically .... it could be SO= easy



On Thu, Jan 22, 2015 at 4:49 PM, Christophe Billiott= et <christophe.billiottet@webdna.us> wrote:
We are working with = [middle] to give it more power. What would you like to get in the next vers= ion? I took note of the "startbefore" and "endafter" op= tions suggested few days ago, but I am interested if someone has more ideas= .

- Chris


=

On 23 Jan 2015, at 00:36, Steve Craig <steve@asylumweb.com> wrote:

=
I=E2=80=99ve always wondered if startafter and endafter were val= id options but never got round to testing. [middle] has never really done w= hat I thought it should.

Cheers
<= /div>
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
Steve C= raig - Asylum Interactive
= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
Skype: s.craig - iChat: steve.craig
=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D



On 22 Jan 2015, at 21:29, David Bas= tedo <david@10plu= s1.com> wrote:

For EndBefore, the docs say: String of tex= t characters to search for defining the end of the text to=20 be returned. All following text (and the EndBefore text itself) will be=20 ignored. --> this suggests that it should ignore everything after this, = including the stop token itself.

so, the block I am trying to = remove: <meta property=3D"og:title" content=3D"Toro= nto Star copyeditor edits memo announcing the elimination of copyeditor job= s" />

by using : property=3D= "og:title" as my starting point
and ideally= , I want to use: " / as my end point.

According to= the docs, I should then get:
content=3D"Toronto St= ar copyeditor edits memo announcing the elimination of copyeditor jobs


regardless, it isn't working and it is= returning the rest of the page after the place where it is supposed to sto= p. It is not stopping.
If I url the start and end, I = get the same result - no stoppage.



[text]theOutPut=3D[tcpcon= nect host=3Dboingboing= .net][!]
[/!][tcpsend skipheader=3DT]GET /2009/11/07/toronto-star-co= pyedi.html HTTP/1.0[!]
[/!][unurl]%0D%0A[/Unurl]Host: boingboing.net[unurl]%0D%0A%0D%0A[/U= nURL][unurl]%0D%0A[/Unurl][!]
[/!][/tcpsend][/tcpconnect][/text]

= [text]myProp2=3D[middle startafter=3D[url]property=3D"og:image"[/= url]&endbefore=3D[url]" /[/url]][theOutPut][/middle][/text]=C2=A0 =
myProp2: [myProp2]<br />


Theoretica= lly, I should be able to stop middle on the first quot that it runs into, a= s opposed to " / and I'd take />, but this breaks middle:
[middle startafter=3D[url]property=3D"og:image"[/url]&endbe= fore=3D[url]/>[/url]][theOutPut][/middle]

this= I get the same result :
[middle startafter=3D[url]property=3D"og:i= mage"[/url]&endbefore=3D[url]/[/url]][theOutPut][/middle]
as

[middle startafter=3D[url]property=3D&= quot;og:image"[/url]&endbefore=3D[url]" /[/url]][theOutPut][/= middle]

and
[middle startafter=3D= [url]property=3D"og:image"[/url]&endbefore=3D[url]"[/url= ]][theOutPut][/middle]


none of these will stop anyt= hing past the endpoint from displaying....=C2=A0 any ideas?

d.=


On Thu, Jan 22, 2015 at 2:24 PM, Kenneth = Grome <ken@webdnasolutions.com> wrote:
endbefore works from the end of the specified text, thus= you
should expect this:

=C2=A0content=3D"Toronto Star copyeditor edits memo announcing the
elimination of copyeditor jobs" />
<meta property=3D"og:description" content=3D"" />=
<meta property=3D"og:url"
content=3D"http://boingboing.net/2009/11/07/toronto-star= -copyedi.html"
/>
<meta property=3D"og:image"
content=3D"http://craphound.com/images/torstarsubsnit.jpeg"= ;

Regards,
Kenneth Grome
WebDNA Solutions
http://www.we= bdnasolutions.com
Web Database Systems and Linux Server Management


On 01/22/2015 12:52 PM, David Bastedo wrote:
> Is there somebody who can shed some light on the proper use of
> [middle]. It's not behaving as I would expect.
>
> I am attempting to extract open graph meta data from html after
> grabbing it via [tcpconnect]
>
> so here is a sample of what I want to extract:
>
> <meta property=3D"og:type" content=3D"article" = />
> <meta property=3D"og:title" content=3D"Toronto Star = copyeditor edits memo announcing the elimination of copyeditor jobs" /= >
> <meta property=3D"og:description" content=3D"" = />
> <meta property=3D"og:url" content=3D"h= ttp://boingboing.net/2009/11/07/toronto-star-copyedi.html" /> > <meta property=3D"og:image" content=3D"http://cra= phound.com/images/torstarsubsnit.jpeg" />
>
> given this code:
> [text]theOutPut=3D[tcpconnect host=3Dboingboing.net <http://boingboing.net>][!]
> [/!][tcpsend skipheader=3DT]GET /2009/11/07/toronto-star-copyedi= .html HTTP/1.0[!]
> [/!][unurl]%0D%0A[/Unurl]Host: boingboing.net <http://boingboing.net>[unurl]%0D%0A%0D%0A[/UnU= RL][unurl]%0D%0A[/Unurl][!]
> [/!][/tcpsend][/tcpconnect][/text]
>
> [text]myProp=3D[middle startafter=3Dproperty=3D"og:title"&am= p;endbefore=3D" /][theOutPut][/middle][/text]
>
> myProp: [myProp]<br />
>
> I would expect MyPprop to return:=C2=A0 =C2=A0content=3D"Toronto = Star copyeditor edits memo announcing the elimination of copyeditor jobs >
> but, it is returning: myProp: content=3D"Toronto Star copyeditor<= br> > edits memo announcing the elimination of copyeditor jobs" /> a= nd
> the rest of the page
>
> If I url everything, it doesn't work at all. Theoretically, middle=
> is built for exactly this purpose and I should be able to run a
> series of Middles to extract specific og tags that I pre-describe.
>
> i'm using 6.2
>
> Thanks in advance!
>
> d.
>
>
>
> --------------------------------------------------------- This<= br> > message is sent to you because you are subscribed to the mailing
> list . To unsubscribe, E-mail to: archives:
> http://mail.webdna.us/list/talk@webdna.us Bug Reporting:
> support@webdna.= us
---------------------------------------------------------
This message is sent to you because you are subscribed to
the mailing list <ta= lk@webdna.us>.
To unsubscribe, E-mail to: <talk-leave@webdna.us>
archives: http://mail.webdna.us/list/talk@webdna.us
Bug Reporting: suppo= rt@webdna.us




--
David Bastedo
<= div>
Ten Plus = One Communications Inc.
http://www.10plus1.com
416.277.4499
--------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: archives: http://mail.webdna.us/list/talk@webdna.us Bug Reporting: suppo= rt@webdna.us

--------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: archives: http://mail.webdna.us/list/talk@webdna.us Bug Reporting: suppo= rt@webdna.us
--------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: archives: http://mail.webdna.us/list/talk@webdna.us Bug Reporting: suppo= rt@webdna.us



--=
David Bastedo

Ten Plus One Communications Inc.http://www.10plus1.c= om
416.277.4499

--001a11c23742831214050d44d3e1-- David Bastedo

DOWNLOAD WEBDNA NOW!

Top Articles:

Talk List

The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...

Related Readings:

I'm Baaaaaaaaaack - Questions! (2000) RE: Can a database get stomped by simultaneous access? (1997) 2.0 Info (1997) Some Questions (1997) Checking for blank form fields (sort of...) (1998) Thanks Grant (1997) Re:quit command on NT (1997) Putting Foundvariables within search. (2000) Sorting by date (1997) Help needed! (1998) Problem (1997) TEXTA-D (2003) [WriteFile] problems (1997) Slightly OT: [sendmail] help (2003) Problem displaying search result (1997) RE: is sku a REQUIRED field on NT (1997) WebCat2 beta 11 - new prefs ... (1997) Search Engine questions ... (2002) Fwd: Problem with order (1998) WebDNA maxing out processor (2008)