Skip to content
This repository has been archived by the owner on Jun 13, 2020. It is now read-only.

String library #675

Open
kiljacken opened this issue Feb 6, 2016 · 22 comments
Open

String library #675

kiljacken opened this issue Feb 6, 2016 · 22 comments
Labels

Comments

@kiljacken
Copy link
Member

kiljacken commented Feb 6, 2016

From @felixangell on January 18, 2016 15:30

String concatenation, destroying strings, substring, etc... also iterating through strings, length, blah blah blah

Copied from original issue: ark-lang/stdlib#3

@felixangell
Copy link
Member

What happened to the stdlib repo?

@kiljacken
Copy link
Member Author

Lol, if you were hanging around irc, you would've know that me an @MovingtoMars agreed that it would be less of a hassle to keep the stdlib in the main repo, as to not have to deal with subtrees/-modules.

@felixangell
Copy link
Member

@kiljacken Yeah I'm cool with that, been a little bit busy with college currently so I'm not as active 🥚

@andrewrk
Copy link

What is the data layout of the string type that Ark users should use?

Ark's standard library makes a call to fopen from libc which takes a char *. This requires a pointer with a null terminated byte. Is this compatible with Ark's string type? If not, what is the plan for conversion?

@kiljacken
Copy link
Member Author

Arks standard string is a non-null terminated []u8, which is just length and data, so we have to do conversion and adding a null terminator. I believe that @0xbadb002 missed this when he implemented the file stuff.

@andrewrk
Copy link

where do you allocate the memory to do the conversion?

@kiljacken
Copy link
Member Author

We don't, currently, but I'd like functions for automatically allocating on
the heap and doing the conversion, and a function for doing the conversion
in user passed memory

On Mon, 29 Feb 2016 01:07 Andrew Kelley, [email protected] wrote:

where do you allocate the memory to do the conversion?


Reply to this email directly or view it on GitHub
#675 (comment).

@felixangell
Copy link
Member

@kiljacken Woops 🌚

@kiljacken
Copy link
Member Author

In relation to my earlier comment on conversion between ark and c string, here is the imagined signatures of the functions we'd want:

// Converts a ark string to a zero-terminated string, in user passed memory
func ToZeroTerminated(src: string, dst: ^u8);

// Wrapper around ToZeroTerminated that allocates memory using mem
func AllocZeroTerminated(src: string) -> ^u8;

// Creates an ark string from a zero-terminated string, using the same backing memory
func WrapZeroTerminated(src: ^u8) -> string;

// Creates an ark string from a zero-terminated string, copying to user passed memory
func FromZeroTerminated(src: ^u8, dst: ^u8) -> string;

// Wrapper around FromZeroTerminated that allocates memory via mem
func AllocFromZeroTerminated(src: ^u8) -> string;

I imagine that these would possibly be (static) methods on string, which would depend on #716.
Naming is very much open to debate.

@felixangell
Copy link
Member

LGTM

On a (semi-related) note, if we were to approve #716, would it be weird to have an alias for a byte pointer that is CString? We could implement these functions on top of CString, and the string type. I think this might be one of the cases where #716 would make things a lot cleaner in this case.

@kiljacken
Copy link
Member Author

That would certainly be nice, from both usability and readability standpoints.

@danharbor95
Copy link

would there be a way to seperate the heap and stack allocation of the strings. you sometimes pass in the data and you have another one that does the allocation for you, its a lot like c

@kiljacken
Copy link
Member Author

@danharbor95 The idea would be that we'd have some way to allocate raw memory on the stack, and then you can use the versions where you pass in a destination pointer.

@danharbor95
Copy link

so like mem alloc but for the stack?

@kiljacken
Copy link
Member Author

@danharbor95 Yes, that's the idea.

@MovingtoMars
Copy link
Member

We don't need both WrapZeroTerminated and FromZeroTerminated. Just using memcpy then WrapZeroTerminated is equivelant to FromZeroTerminated.

@kiljacken
Copy link
Member Author

@MovingtoMars Well, the idea here is ease of use. We might as well provide a function (FromZeroTerminated) that does the strlen and memcpy for the user.

@MovingtoMars
Copy link
Member

I think it's complicating the API for too little gain.

@kiljacken
Copy link
Member Author

@MovingtoMars Well, we'll just have to agree to disagree on this then. I do not think it thing it complicates the API any amount worth worrying about, an thus I believe that the usability gain far outweighs any concern about complexity in this case..

@MovingtoMars
Copy link
Member

I'd be happier to include it if the names were better. Those names aren't distinct enough; it isn't clear what the differences are.

@kiljacken
Copy link
Member Author

@MovingtoMars I already acknowledged that the names were far from final in the original suggestion. They were simply there to get my ideas across.

Naming is very much open to debate.

You're free to help come up with better names.

@felixangell
Copy link
Member

Gonna have to say I agree with @kiljacken here on this one, the naming isn't pretty but I'm sure things will be cleaner. Especially if we approve #716 😉

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

5 participants