hanging indent with symbolic operators - Haskell

Welcome to the Functional Programming Zulip Chat Archive. You can join the chat here.

Torsten Schmits

2021-03-04 11:09:33

can anyone explain why this parses:

a :: IO Int
a = do a <- pure 5
       pure a
 >>= pure
       pure 5

is there a rule about operators that overrides normal layouting rules? it appears that the indent of that line must be larger than the parent layout (like in case of a nested do, it's the outer do's layout indent). it also only works for expression statements, not binders.

if someone can point me to documentation, that would be great

2021-03-04 11:38:22

https://www.haskell.org/onlinereport/haskell2010/haskellch10.html#x17-17800010.3 provides algorithmic description of layout resolution - if you look at lines mentioning <n> (line fold), in your case they'll add } before the operator, because indent is lower than one set by do, and pop the <n>, because it's bigger than outer context

2021-03-04 11:39:47

Oh, wait, I've now realized you mean line after that :sweat_smile:

Georgi Lyubenov // googleson78

2021-03-04 11:41:05

ghc parser confirmed that in fact the whole do is the arg to the bind

Georgi Lyubenov // googleson78

2021-03-04 11:41:17

I would have guessed this requires BlockArguments tbh

2021-03-04 11:41:41

Okay, but how's

pure
  pure 5

valid?

Torsten Schmits

2021-03-04 11:41:48

:grinning:

Georgi Lyubenov // googleson78

2021-03-04 11:41:52

hahaha

Torsten Schmits

2021-03-04 11:41:57

Applicative (r->)

Georgi Lyubenov // googleson78

2021-03-04 11:41:57

the first pure is const

Georgi Lyubenov // googleson78

2021-03-04 11:42:15

it's const pure 5

2021-03-04 11:42:31

Ah, I see now :joy:

Georgi Lyubenov // googleson78

2021-03-04 11:42:47

maybe we should assassinate that instance from the ghc codebase

Georgi Lyubenov // googleson78

2021-03-04 11:43:07

while nobody is watching

Torsten Schmits

2021-03-04 11:43:14

now how to parse this:

a :: IO Int
a =
  do
       pure 5
       >>= f

Georgi Lyubenov // googleson78

2021-03-04 11:43:22

whoever is using it can't be up to any good

Torsten Schmits

2021-03-04 11:43:32

(inspired by a file from HLS breaking my parser)

Torsten Schmits

2021-03-04 11:43:55

apparently it might be (do pure 5) >>= f

Torsten Schmits

2021-03-04 11:44:41

Georgi Lyubenov // googleson78 said:

maybe we should assassinate that instance from the ghc codebase

hacker in hoodie gif

Torsten Schmits

2021-03-04 11:46:03

indeed!

2021-03-04 11:47:17

I would expect GHC to do do (pure 5 >>= f) because of NonDecreasingIndentation

Torsten Schmits

2021-03-04 11:47:44

not in a layout

Torsten Schmits

2021-03-04 11:48:24

if you move the >>= one char to the right, it's like you wrote

2021-03-04 11:48:47

Oh, it's for {n}, not <n>, right

Georgi Lyubenov // googleson78

2021-03-04 11:51:30

https://giphy.com/gifs/theoffice-the-office-tv-frame-toby-vyTnNTrs3wqQ0UIvwE

Season 5 No GIF by The Office - Find & Share on GIPHY

Discover & share this The Office GIF with everyone you know. GIPHY is how you search, share, discover, and create GIFs.

2021-03-04 11:53:14

We need-Wimplicit-layout in GHC, so than we can safely forget about it and write braces everywhere :sweat_smile:

Torsten Schmits

2021-03-04 11:53:46

doesn't help if you want to parse other people's code!

Georgi Lyubenov // googleson78

2021-03-04 11:55:54

or be more strict about what is allowed :sweat:

Torsten Schmits

2021-03-04 11:56:08

that will never happen :cry:

Torsten Schmits

2021-03-04 11:56:31

two spaces indent enforced everywhere. newline after layout open

2021-03-04 11:58:04

Thing is though, we often write stuff that seems completely reasonable to humans but requires this flexibility, like

if _ then do
  _
else do
  _

Torsten Schmits

2021-03-04 11:59:13

looks like it matches my rules

Georgi Lyubenov // googleson78

2021-03-04 12:00:07

then opens a layout, but no newline after it?

Georgi Lyubenov // googleson78

2021-03-04 12:00:11

right?

Torsten Schmits

2021-03-04 12:00:22

then don't open no layout

2021-03-04 12:00:31

only do does for expressions

Torsten Schmits

2021-03-04 12:00:37

only let, of, where, do

2021-03-04 12:01:13

@Georgi Lyubenov // googleson78 what you see with newlines after tokens like = is indentation rule for surrounding layout

2021-03-04 12:03:39

L (< n >: ts) (m : ms) = ; : (L ts (m : ms)) if m = n
                       = } : (L (< n >: ts) ms) if n < m
L (< n >: ts) ms = L ts ms -- this line

Torsten Schmits

2021-03-04 12:04:51

a :: IO Int
a =
  do
       pure 5
       >>= f

my question now is: what is the condition in the layout algorithm that detects that >>= closes the do instead of only starting a new statement? is it because the op is symbolic?

Torsten Schmits

2021-03-04 12:05:51

or are there more cases that cause this?

Torsten Schmits

2021-03-04 12:06:04

it can't be type-directed, right?

2021-03-04 12:06:06

 L (t : ts) (m : ms) = } : (L (t : ts) ms) if m∕ = 0 and parse-error(t)

parse-error(>>=) should be true, shouldn't it?

Torsten Schmits

2021-03-04 12:06:20

huh

2021-03-04 12:06:27

it expects expression, but gets some operator

Torsten Schmits

2021-03-04 12:06:28

that sucks

2021-03-04 12:06:47

Yeah - reason why lexer and parser have to be entangled in Haskell

Torsten Schmits

2021-03-04 12:07:10

guess I'll have to use the symbolic character as a condition for layout_end and hope for the best

2021-03-04 12:08:42

One more case is then BTW

Torsten Schmits

2021-03-04 12:09:04

how so?

2021-03-04 12:09:32

You can do if do True then ...

Torsten Schmits

2021-03-04 12:10:37

indeed, that confuses my parser as well

2021-03-04 12:10:50

Or

if do True
      then True else False

Torsten Schmits

2021-03-04 12:11:30

does that apply to other layouts or only do?

Torsten Schmits

2021-03-04 12:11:54

if a where a = 1 then

Torsten Schmits

2021-03-04 12:12:30

nope that's a parse error

Torsten Schmits

2021-03-04 12:12:56

right, where isn't allowed after just any expression

2021-03-04 12:13:19

Hmm, let is always guarded by something, and where could only appear inside of it or in top level block

Torsten Schmits

2021-03-04 12:13:35

I'm beginning to think that I should track do separately from the other layouts

2021-03-04 12:14:21

I mean, if you did the same trick for all of them, what would happen? It will end up with parse error anyway, won't it?

Torsten Schmits

2021-03-04 12:14:52

a = if case 5 of 5 -> True then 1 else 1

this is weird

2021-03-04 12:15:38

But if you want to be GHC-complaint, you do have to support NonDecreasingIndetation, so do actually ends up being different after all

Torsten Schmits

2021-03-04 12:15:41

I mean, if you did the same trick for all of them, what would happen? It will end up with parse error anyway, won't it?

:thinking:

Torsten Schmits

2021-03-04 12:28:05

TheMatten said:

But if you want to be GHC-complaint, you do have to support NonDecreasingIndetation, so do actually ends up being different after all

why does that make it different?

2021-03-04 12:28:44

You accept layout of do starting at the same level as the previous one

2021-03-04 12:29:07

do _
   if _ then _ else do
   _

2021-03-04 12:29:56

This doesn't apply to other layouts AFAIK

Torsten Schmits

2021-03-04 12:30:42

ugh come on!

Torsten Schmits

2021-03-04 12:31:23

welp, guess it's time to rewrite the layout handler

Torsten Schmits

2021-03-04 12:32:42

do you have a link for NonDecreasingIndentation?

Torsten Schmits

2021-03-04 12:33:40

or is that encoded in the layout section of the syntax reference?

2021-03-04 12:34:05

https://downloads.haskell.org/ghc/latest/docs/html/users_guide/bugs.html?highlight=nondecreasingindentation#context-free-syntax

Torsten Schmits

2021-03-04 12:35:04

"bugs"

2021-03-04 12:35:04

In principle it's matter of switching > for >= in case of do layout

Torsten Schmits

2021-03-04 12:35:35

"but not in Haskell2010 mode"?

Torsten Schmits

2021-03-04 12:36:57

does that mean it's impossible to tell whether it's legal to use the same indent?

Torsten Schmits

2021-03-04 12:37:15

(just by parsing the file)

2021-03-04 12:39:24

I guess that applies to all syntactic extensions

Torsten Schmits

2021-03-04 12:40:04

well it's simple to assume that a \case means we want LambdaCase

2021-03-04 12:40:56

But what about static keyword? :big_smile:

Torsten Schmits

2021-03-04 12:41:03

right

2021-03-04 12:41:29

or proc do

2021-03-04 12:41:37

or rec

2021-03-04 12:43:05

That's basically why I've decided to work with Haskell2010 in Hask - there's just too much stuff in GHC :sweat_smile:

Torsten Schmits

2021-03-04 12:43:47

well if you're lucky the parser will just decide the right variant of rec being keyword or identifier

Torsten Schmits

2021-03-04 12:44:43

in my case, the parse tree isn't used for typing afterwards, so it doesn't matter