Using rib with my own parser

actually, now that I think about it, I think i don't have markdown w/ pollen, just some subset of things I want are built in to the markup (e.g. double newlines -> paragraphs

Joel McCracken

2019-12-26 03:50:01

So, as far as I can tell, I will have to modify rib itself, i can't just implement my own IsMarkup thing

Sridhar Ratnakumar

2019-12-26 03:50:51

Yes, but the only modification you need to do is to add a constructor to the Markup type. This is something we can obviate by improving the API.

cool!

Would it be worth it for me to try to figure it out?

Sridhar Ratnakumar

2019-12-26 03:51:29

Assuming you are using the master branch (and not one on Hackage)

Sridhar Ratnakumar

2019-12-26 03:51:47

Sure; I'm here to help

Sridhar Ratnakumar

2019-12-26 03:52:08

This type basically https://github.com/srid/rib/blob/master/src/Rib/Markup.hs#L26-L28

srid/rib

Haskell library for writing your own static site generator - srid/rib

Sridhar Ratnakumar

2019-12-26 03:52:30

You'd add a constructor like Markup_MyXmlThingy :: Markup MyXmlDoc

Sridhar Ratnakumar

2019-12-26 03:52:45

(in addition to writing an instance for IsMarkup MyXmlDoc)

Sridhar Ratnakumar

2019-12-26 03:54:25

See https://github.com/srid/rib/blob/master/src/Rib/Markup/MMark.hs for an example. That adds mmark support.

srid/rib

Haskell library for writing your own static site generator - srid/rib

Sridhar Ratnakumar

2019-12-26 03:55:21

Tomorrow I'll figure out if there is a way to obviate it. But meanwhile you may just modify rib ...

Sridhar Ratnakumar

2019-12-26 03:56:18

The only reason the Markup doc type exists is to build the patterns dict for passing to buildHtmlMulti.

Joel McCracken

2019-12-26 03:57:05

well what I meant was figuring out how to get rid of it

Sridhar Ratnakumar

2019-12-26 03:57:08

        Rib.buildHtmlMulti patterns $
          renderPage . Page_Doc
    -- File patterns to build, using the associated markup parser
    patterns =
      Map.fromList
        [ ([relfile|*.md|], Some Rib.Markup_MMark)
        ]

That tells the function, "yea, build *.md files using this markup parser"

Joel McCracken

2019-12-26 03:57:13

(the need to modify RIB itself)

sure

:thumbs_up:

id def prefer to adoid having my own fork for long

Joel McCracken

2019-12-26 03:58:58

question @Sridhar Ratnakumar does the desire I am describing ring true for you too? (that is, writing in an extensible markup language)

Sridhar Ratnakumar

2019-12-26 03:59:05

Perhaps buildHtmlMulti can take the IsMarkup constraint, and then you call it multiple times, one for each markup type:

mdPages <- buildHtmlMulti @MMark [relfile|*.md] ...
xmlPages <- buildHtmlMulti @MyXmlType [relfile|*.xml] ...
let allPages = mdPages <> xmlPages

Joel McCracken

2019-12-26 03:59:08

its weird to me that it seems like most people do not care about it

Joel McCracken

2019-12-26 03:59:32

im not sure if i'm just a weirdo, or if its like people never think about it, or what

Joel McCracken

2019-12-26 03:59:45

yea thats kinda what I was thinking

Sridhar Ratnakumar

2019-12-26 04:00:08

I've thought about something like xml. Or yaml. Or even dhall ... as representation of structured data (if not text).

Sridhar Ratnakumar

2019-12-26 04:00:28

For example, links.dhall - which gets processed and rendered into a page of links.

Joel McCracken

2019-12-26 04:00:31

really the reason I like xml over those options is that i think its a bit more flexible

Joel McCracken

2019-12-26 04:00:42

like, xml already supports e.g. CDATA

Joel McCracken

2019-12-26 04:00:48

If i want to embed something funky, its there

Joel McCracken

2019-12-26 04:00:51

but yeah i'm with you

Sridhar Ratnakumar

2019-12-26 04:01:05

I just wouldn't want to use xml for formatting text content.

same

But for structured data, it is fine.

Sridhar Ratnakumar

2019-12-26 04:01:36

One of these days I want to add dhall support.

Joel McCracken

2019-12-26 04:02:04

i mean, something like adding *bold* to the xml processing would go a really long away

Sridhar Ratnakumar

2019-12-26 04:02:43

during development of rib you may find nix-shell --run ghcid to be handy.

Joel McCracken

2019-12-26 04:02:44

text separated by a blank line to mean new paragraph, etc

cool

And to test your library changes, you can clone https://github.com/srid/rib-sample and run nix-shell --arg rib /path/to/your/rib --run 'ghcid -T ":main serve -p 9876"' (spins up http://localhost:9876). though you'd have to re-run the command after changing rib library.

srid/rib-sample

Sample site for the Rib static site generator. Contribute to srid/rib-sample development by creating an account on GitHub.

Joel McCracken

2019-12-26 04:04:57

would it be appropriate to add as ection to the readme

for "developing rib"

ya, good idea

i almost always do that

Joel McCracken

2019-12-26 04:05:25

like for my own stuff

Joel McCracken

2019-12-26 04:05:31

because i freaking forget which version of a command works

Joel McCracken

2019-12-26 04:05:44

for some reason i have a very hard time figuring out the right way to make ghcid do what I want it to do

Sridhar Ratnakumar

2019-12-26 04:06:00

i almost never have to configure ghcid.

Sridhar Ratnakumar

2019-12-26 04:06:08

defaults work fine for me

except for the -T argument, which is kind of needed if you also want to run the program in addition to compiling it

Sridhar Ratnakumar

2019-12-26 04:06:58

ghcid -T "blah" will basically evaluate "blah" in the ghci repl it launches

Sridhar Ratnakumar

2019-12-26 04:07:06

(right after every successful compile)

Joel McCracken

2019-12-26 04:07:13

liek i had to do this to make the tests run

Joel McCracken

2019-12-26 04:07:14

ghcid -c 'stack ghci joelmccracken-hs:joelmccracken-hs-test' --test 'Main.main'

Joel McCracken

2019-12-26 04:07:39

in this repo https://gitlab.com/JoelMcCracken/joelmccracken-hs

GitLab.com

Joel McCracken

2019-12-26 04:08:01

(this is where I was spiking out the xml processing for my xml version to validate i have a path forward)

Sridhar Ratnakumar

2019-12-26 04:08:31

you could just put those CLI in scripts/myscript

Sridhar Ratnakumar

2019-12-26 04:09:07

btw, development stuff may instead be added to https://github.com/srid/rib/blob/master/CONTRIBUTING.md

srid/rib

Haskell library for writing your own static site generator - srid/rib

Sridhar Ratnakumar

2019-12-26 04:09:22

feel free to open a PR, otherwise i'll add it tomorrow

i probably wont start tonight, about to go to bed

Joel McCracken

2019-12-26 04:16:02

and i am going to be afk for a few days

Joel McCracken

2019-12-26 04:16:11

but i might be hacking on it

family time

oh yea, merry christmas! :D

Joel McCracken

2019-12-26 04:18:10

ha ty, i dont celebrate it but we are going to get together as family

Sridhar Ratnakumar

2019-12-29 01:45:05

Looks like Data.Some and Data.Proxy is coming in handy to achieve this (eliminating need to modify rib)

Sridhar Ratnakumar

2019-12-29 02:02:48

I'm wondering if the 'meta' from Document meta should be discarded, for simplicity.

Sridhar Ratnakumar

2019-12-29 02:05:16

@Joel McCracken If you are curious, in particular this commit: https://github.com/srid/rib/pull/65/commits/4f42517abb6d244e51013644cb09a571341c77e7

Use Some & Proxy for markup customization by srid · Pull Request #65 · srid/rib

Fixes #62 todo Simplify API, lest user has to use Some too much? Test with rib-sample in a branch

Sridhar Ratnakumar

2019-12-29 04:13:28

Actually there are some problems with this. I'll address them.

Sridhar Ratnakumar

2019-12-29 17:55:39

Looks to be like in the end IsMarkup type class will have only one method left, to parse the file into some type. That's greatly simplifying.

Sridhar Ratnakumar

2019-12-29 22:34:26

w00t! got dhall version working. both markdown and dhall at same time, without having to change rib

Joel McCracken

2019-12-29 23:09:07

so its interesting, I really don't know if a "metas" section is necessary

Sridhar Ratnakumar

2019-12-29 23:09:20

meta is gone in the latest PR

Sridhar Ratnakumar

2019-12-29 23:09:34

https://github.com/srid/rib/pull/65

Refactor API, so users can define their own markup types by srid · Pull Request #65 · srid/rib

Fixes #62 Still a WIP Simplify API, lest user has to use Some too much? Test with rib-sample in a branch

Joel McCracken

2019-12-29 23:09:36

it is necessary if 1) you need some kind of meta support and 2) your markup doesnt support some kind of metas

nice

instead I exposed metadata functions from MMark.hs and Pandoc.hs that the user can explicitly call if they need to extract meta

excellent

yeah, that sounds good

Joel McCracken

2019-12-29 23:10:48

but how do you handle like

Sridhar Ratnakumar

2019-12-29 23:10:57

I'm wondering if I can do away with the type class itself, and instead expect just a function Text -> IO a :-)

Joel McCracken

2019-12-29 23:11:21

possibly? hard to say per se

Sridhar Ratnakumar

2019-12-29 23:11:35

I'm not sure yet; still exploring in code ...

Joel McCracken

2019-12-29 23:11:43

if it were me, given your stated goal, i think what I would do is make handy, discrete functions

Joel McCracken

2019-12-29 23:11:52

that can be combined arbitrarily

Joel McCracken

2019-12-29 23:11:58

not saying you aren't doing that

Sridhar Ratnakumar

2019-12-29 23:12:31

The document type now looks like this:

data Document repr
  = Document
      { -- | Path to the document; relative to the source directory.
        _document_path :: Path Rel File,
        -- | Parsed representation of the document.
        _document_val :: repr
      }
  deriving (Generic, Functor)

Basically a tuple of filepath and parsed structure. That's the minimum rib will need to pass over to the user.

Joel McCracken

2019-12-29 23:12:35

just like i feel like I am not sure what probelm that typeclass solves

if that makes sense

ahh nice

And type class's responsibility has been reduced to ... basically creating this Document value. That's all it essentially does.

Sridhar Ratnakumar

2019-12-29 23:13:35

Well , it's a learning process for me - rib being my first haskell library :-D so I'm not surprised to begin from a place of overengineering.

yea

haha

no worires man! its better than my first attempt would be, for sure

Sridhar Ratnakumar

2019-12-29 23:14:10

anywya, in your case, it would be Document MyXmlType

yea

replacing type class with something like:

type MarkupParser a = forall m b. MonadIO m => Path b File -> m (Either Text a)

Sridhar Ratnakumar

2019-12-29 23:22:12

wow it actually worked haha

Sridhar Ratnakumar

2019-12-29 23:27:03

okay, so the shake function will looke like this:

buildHtmlMulti patterns myMarkupParser myPageBuilder

patterns, input function, output function. that's it

Sridhar Ratnakumar

2019-12-29 23:38:07

It's gone now: https://github.com/srid/rib/pull/65/commits/89f4843676dbf0f10a8e773534761785a814a74f

Refactor API, so users can define their own markup types by srid · Pull Request #65 · srid/rib

Fixes #62 Still a WIP Simplify API, lest user has to use Some too much? Test with rib-sample in a branch

Joel McCracken

2019-12-30 00:13:06

do you think its stableenough to develop against? i fetched your branch

Sridhar Ratnakumar

2019-12-30 00:58:53

I think it is nearly done. What do you think of the API? Looking for some final tweaks before I merge.

Sridhar Ratnakumar

2019-12-30 00:59:42

(One last thing to do before I merge the PR is to update rib-sample)

Sridhar Ratnakumar

2019-12-30 01:08:36

I don't like "Rib.Markup". Dhall is not a markup. So I'm gonna rename it to something more general. Since there is now Rib.Source (renamed from 'Document', as 'source' reflects its nature more precisely), I'll try Rib.Parser

Sridhar Ratnakumar

2019-12-30 01:40:43

Okay, branch is stable. Going to test for bugs. Then merge. Here's the user-facing upgrade path you can expect: https://github.com/srid/rib-sample/pull/4/files

Upgrade to Rib's API refactor (removal of type class, etc.) by srid · Pull Request #4 · srid/rib-sample

For srid/rib#65

Sridhar Ratnakumar

2019-12-30 01:41:40

In that PR, notice the use of M.parseIO. That's where you would inject your Xml parser function.

Sridhar Ratnakumar

2019-12-30 01:54:15

@Joel McCracken Everything merged to master!

Sridhar Ratnakumar

2019-12-31 17:03:30

@Joel McCracken When you get to play with it, I'd be interested in hearing your feedback regarding the SourceReader type. Whether it does what you want re: your xml parsing. Personally I just found that I needed to switch from IO to shake's Action monad, when working with dhall parsing, so I am proposing this change: https://github.com/srid/rib/pull/69

Make SourceReader a Shake Action by srid · Pull Request #69 · srid/rib

This will allow us to do Shake-y things in our readers. For example, parsing our Dhall files may require need'ing its dependent .dhall files, doing which ensures that when those files change ri...

Joel McCracken

2019-12-31 17:05:04

I'm going to work with this today, at least I am planning on it. I'll let you know when I do

Sridhar Ratnakumar

2020-01-01 01:20:19

I merged this PR. Shouldn't be huge change in the user code. Just replace IO a with Action a in your reader function. And prepend all IO functions with liftIO. Prefer readFile' (from import Development.Shake) when reading some file, as it will track the dependency automatically.

Sridhar Ratnakumar

2020-01-01 01:46:22

Here's what adding Dhall parser support to a rib-based static website currently looks like :-) https://github.com/srid/website/pull/6/files

Add links list, using Dhall by srid · Pull Request #6 · srid/website

cf. srid/rib#64

Sridhar Ratnakumar

2020-01-01 01:46:40

I think I'm gonna add that parse function into Rib.Parser.Dhall

Using rib with my own parser - Rib