Beta FAQs
Below are some questions and answers which might help you get around Kasabi Beta.
Datasets
- What datasets can I use?
- Have you got free datasets available?
- I’ve got some data, can I publish it?
- Can I download a dataset?
- Do you have to host all the data?
- Can I load this CSV/Spreadsheet/Access Database into Kasabi??
- What is Linked Data?
- Where can I get help converting my Data into RDF??
- Do I have to understand SPARQL or RDF to use data in Kasabi?
- Can I query across datasets?
- Can I get everything from the Linked Data cloud in Kasabi?
- How will I be able to load my data into Kasabi?
- Will Kasabi crawl my data?
APIs
- Why would I want to create a Custom API?
- Why do you require API Keys?
- Why API Keys instead of OAuth or other authentication option?
- What options will there be for managing access to a dataset?
Beta
Support
- How do I report a data problem, or share my idea?
- Where can I get help on using your APIs?
- Is there a mailing list for Kasabi Developers?
- I think somebody is using my data in Kasabi without my permission, how can I report this?
Datasets
You can use any dataset that you can get a hold of during the Beta. Each set has its license and terms of use alongside it. As we roll out the marketplace, datasets will each have their own access settings. These are clearly identified on the dataset page.
Have you got free datasets available?
Yes, Kasabi hosts a selection of “open” datasets. Each dataset is clearly marked with its licensing information, and will indicate any charges clearly. Throughout the Beta period, the datasets available will be predominately—if not exclusively—free. Once the marketplace is launched, following the beta trial period, some datasets will remain free. We are working on the commerce options for data providers, who will set charges for dataset use. Some commercial datasets may also be made available free of charge, subject to terms.
I’ve got some data, can I publish it?
Yes, you can. From your dashboard (hint: click on your name at the top of Kasabi), you can create a dataset, configure its licensing, settings and supply developer documentation, and upload data to Kasabi. Kasabi hosts Linked Data, so any RDF file can be uploaded, or you can point at a URI for RDFa data.
No. Kasabi is a hosted service, and downloading data isn’t a feature we’re planning to support. The reasoning behind this is partly for data providers to be able to see how their data is being used, and partly because we see Kasabi’s role in curation as being a valuable aspect of the marketplace. We can’t keep download versions up to date, for example. Data providers may make datasets available for download on their own terms, but not via Kasabi.
Do you have to host all the data?
As with the question above, Kasabi is a hosted marketplace, and the value we add for both developers and data providers is built on our looking after the datasets.
Can I load this CSV/Spreadsheet/Access Database into Kasabi?
Possibly. We’re working on our ingestion features. The best thing you can do is to raise a ticket for our support team, outlining details about the dataset as specifically as possible. We’ll have a look and see what it’d take to get into Kasabi and let you know.
Another thing to look at is Google Refine, and making use of Kasabi’s Reconciliation API.
“Linked Data” refers to the way information is published on the web. With Linked Data, each piece of data is given a URI, allowing it to be linked to and accessed by web applications and services. Linked Data is supported by a set of standards from the W3C for publishing and accessing data. The format for publishing structured data is called RDF, and the SPARQL language is used for querying it. These slides from Talis’ Rob Styles introduce Linked Data, RDF and SPARQL.
Data providers can make use of Kasabi’s Linked Data hosting to make their datasets available to Linked Data developers and applications.
In Kasabi, developers and domain experts can access datasets in a variety of ways, including SPARQL and custom APIs.
Where can I get help converting my Data into RDF?
Depending on the size and nature of your dataset, we may be able to help you from within the Kasabi Team. If not, Talis’ Consulting Team has years of experience in modelling and converting data, and might be able to help you out too. In either case, drop us a line and we’ll see what we can do.
Do I have to understand SPARQL or RDF to use data in Kasabi?
Absolutely not! Kasabi provides a range of data-access options, APIs and even the ability to customise your own API to access data in exactly the way you need it. The full power of SPARQL is there too, of course, but it is not necessary to learn.
As Kasabi is developed and its features are built and polished, it is our aim to build tools which don’t require a full set of developer skills in order to access and work with data. Keep an eye on the Kasabi blog for our vision and updates.
It is not yet possible to query across multiple datasets using SPARQL. One of the things we’re working on, however, is offering the ability to layer datasets together.
Can I get everything from the Linked Data cloud in Kasabi?
Not necessarily. It isn’t our intention to simply mirror freely-available datasets. If a particular ‘set is useful, however, and you would like to see it loaded (see question above), why not raise a support ticket and let us know which set you need. We’ll see what we can do.
How will I be able to load my data into Kasabi?
From the dashboard, you can select files to upload into your dataset, or point Kasabi at a URI for RDFa data. You can also use the dataset management APIS (such as the Update API).
We do have a crawler, but it’s still being house-trained. If you have something you’d like crawled, drop us a line, and we’ll see if he’s ready for you.
APIs
Why would I want to create a Custom API?
A custom API allows you tailor access to the dataset. This custom access will then be suited to your particular application or user community. By creating and maintaining a custom API over the data, you won’t be constrained by the default APIs provided by Kasabi or the data owner.
By allowing the developer community to share its skills in ways other than just creating applications, Kasabi lets us broaden the definition of data curation to cover APIs and access as well as the data itself.
Over time we hope to expand on the range of customisation on offer to put more power into the hands of developers.
An important part of Kasabi is letting data providers explore the potential (commercial and utility) of their data. API Keys let us track the actual usage of each dataset and API, giving us the ability to provide stats to the data providers and curators. With these stats, they are in a better position to understand how their data is being used, and to what extent it’s being picked up.
Why API Keys instead of OAuth or other authentication option?
We haven’t permanently ruled out any authentication facility. Currently, we are using API keys to track usage and authenticate against data usage. We are exploring other options and will be considering them throughout the beta (and beyond).
What options will there be for managing access to a dataset?
Ideally, users will be able to subscribe to a dataset or API within Kasabi itself. We will work with Providers for additional access options including the ability to restrict access to a request. You’ll need to discuss this option with us, as access will default to letting any user subscribe (subject to a dataset’s individual terms and licensing).
Beta
Can I tweet or blog about Kasabi?
Yes, absolutely. Feel free to share your experience with the beta.
Can I store my code on Github?
Yes. You can store code on hosted services. We like and use Github ourselves and are looking at ways to integrate it into Kasabi in future.
Can I invite a friend to the Beta?
Kasabi is open for anyone to register, now that we’ve moved into Open Beta, so yes, you can. Just point them at kasabi.com :).
Support
How do I report a data problem, or share my idea?
The support desk is the best place to bring problems to our attention. From there, you can have a look at the community forums, report problems, and suggest features. You can also raise a support ticket by email our support desk: support@kasabi.zendesk.com.
Where can I get help on using your APIs?
The best place to start is with the API Documentation, which will give you an overview of Kasabi’s Core APIs, and outlines how to build your own too. Each API is accompanied with its own documentation in situ, so you can get all the information supplied for that API on its own Kasabi page.
If you’re still stuck, you can always look for help in the support area, or click that support tab on the left of the Kasabi page.
Is there a mailing list for Kasabi Developers?
You know we wouldn’t have included the question in the list if there weren’t! There is a google group for the developer network (http://groups.google.com/group/Kasabi-dev). You can also join us on IRC (freenode.net) in #Kasabi.
I think somebody is using my data in Kasabi without my permission, how can I report this?
We take the issues of licensing and data ownership very seriously. If you think someone has uploaded data you own the rights to into Kasabi without your permission, or you suspect a breach of licensing terms: contact us at support@kasabi.zendesk.com and tell us as much about the dataset and potential infringement as possible. We’ll get back to you as soon as we can.